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Unfolding  (estimating  a  particle  else  distribution  la  an  old  and 
well  known  problem  in  stereo logy  which-has  not  yet  been  solved  satis¬ 
factorily.  A  description  of  the  problem  is  as  follows:  spherical  parti¬ 
cles  are  dispersed  randomly  In  a  three-dimensional  body.  The  classical 
assumption  is  that  the  centers  of  the  spheres  are  distributed  according 
to  a  dilute  Poisson  process.  The  radii  of  such  spheres  have  a  distribu¬ 
tion  G  independent  of  everything  else.  A  random  probe  (line,  plane, 
or  thin  slice)  is  cut  through  the  volume.  The  observations  y  are  then 
the  following: 

i)  For  the  random  line,  its  intersection  with  a  sphere  is  a  line 
segment,  y  is  half  the  length  of  that  segment. 

il)  For  the  random  plane,  y  is  the  radius  of  the  circle  of 
intersection. 

ill)  For  the  thin  slice,  y  is  the  profile  radius,  that  is,  the 
maximum  of  the  radii  of  the  circles  of  intersection.  The  problem  is 
to  estimate  the  distribution  G  from  the  observations  y  ^ , . . . ,  y^ . 

Past  procedures  make  use  of  the  inversion  formula  that  expresses 
the  particle  size  distribution  G  as  a  function  of  the  distribution 
H  of  the  observed  data.  An  estimate  G  is  obtained  by  replacing  H 
in  the  formula  by  an  estimate  H  where  H  is  either  the  sample  c.d.f. 
or  a  smoothed  version  of  it.  These  procedures  do  not  take  the  structure 
of  the  problem  into  account.  Consequently,  they  have  the  following 
shortcomings: 

i)  H  does  not  belong  to  the  admissible  range  of  H  with  the 
result  that  G  is  not  a  distribution  function. 


li)  The  same  estlmete  of  H  is  used  regardless  of  the  kind  of 
data  (linear,  planar  or  thin  slice). 

ill)  They  cannot  Incorporate  additional  information  concerning  G. 

In  this  dissertation,  we  propose  a  new  procedure  from  the  viewpoint 
of  nonparametrlc  estimation  of  mixing  distributions  (see  sections  1.4, 
1.5,  1.7).  In  section  1.9,  we  show  how  the  new  approach  can  deal  with 
all  three  shortcomings  of  the  classical  procedures.  We  consider  linear, 
planar,  and  thin  slice  data  (chapters  II,  III  and  IV  respectively).  In 
all  three  cases,  our  approach  performs  better  than  the  classical  proce¬ 
dures  (sections  2.4,  3.6,  4.8).  In  addition,  we  prove  consistency 
results  (sections  2.3,  3.5,  4.7). 

In  the  random  line  case,  the  nonparametrlc  maximum  likelihood 
estimate  (MLE)  of  G  is  derived  (section  2.3).  However,  because  of 
a  "peaking"  effect  of  the  MLE  near  the  mode,  we  obtain  extremely  poor 
simulation  results.  By  modifying  the  procedure  slightly,  we  are  able 
to  obtain  much  better  results  (section  2.4). 

The  importance  of  having  H  lie  within  the  admissible  range  is 
highlighted  In  the  random  plane  case  where  we  want  to  bootstrap  the 
distribution  of  a  stereological  estimate.  In  section  3.7,  we  show 
that  it  is  crucial  to  resample  from  an  estimate  H  that  belongs  to 
the  admissible  range,  since  the  sample  c.d.f .  does  not,  the  usual  way 
of  resampling  from  the  sample  c.d.f.  will  not  work.  We  also  indicate 
how  bootstrap  methods  can  contribute  to  stereology. 


In  the  thin  alien  case,  thn  formulas  derived  In  the  literature  are 
valid  only  when  G  is  continuous.  In  section  4.2,  we  derive  the  correct 
formulas  when  G  is  mixed  or  discrete.  These  formulas  Involve  a  decom¬ 
position  of  H  Into  its  continuous  and  discrete  component .  This  makes 
the  estimation  problem  more  complicated  but  aleo  more  interesting  especially 
In  the  discrete  case.  We  propose  a  few  procedures  which  involve  a  decom¬ 
position  of  the  data  correepondlng  to  that  of  H  (sections  4.3,  4.5,  4.7). 
The  discrete  case  is  more  interesting  because  of  the  following: 

1)  There  are  two  Inversion  formulas  from  which  we  can  derive  an 
estimate  of  G. 

11)  Part  of  the  thin  slice  data  can  be  treated  as  planar  data, 
ill)  The  support  of  G,  if  it  is  flnir  ran  be  determined  from 
the  data  with  high  probability. 


CHAPTER  I 


GENERAL  THEORY 


Description  of  the  stereolosical  problem. 


Spherical  particles  are  dispersed  randomly  in  a  three-dimensional 
body.  The  classical  assumption  is  that  the  centers  of  the  spheres  are 
distributed  according  to  a  dilute  Poisson  process.  The  radius  B  of 
such  spheres  is  distributed  according  to  G  independent  of  everything 
else.  A  random  probe  (line,  plane,  or  thin  slice)  is  cut  through  the 
volume.  The  observed  y’s  are  the  following: 

i)  For  the  random  line,  its  intersection  wich  a  sphere  is  a  line 
segment,  y  is  half  the  length  of  that  segment. 

ii)  For  the  random  plane,  y  is  the  radius  of  the  circle  of  inter 
section. 

ill)  For  the  th_.  slice,  y  is  the  profile  radius,  that  is,  the 
maximum  of  the  radii  of  the  circles  of  intersection.  The  problem  is 
to  estimate  the  distribution  G  from  observations  ’  *^n* 

Stoyan  (1979)  calculated  that  the  Poisson  process  has  to  be  very 
dilute  in  order  for  the  probability  of  the  spheres  to  overlap  to  be 
small.  Thus  the  classical  Poisson  model  was  rather  unrealistic. 
Recently,  Mecke  and  Stoyan  (1980)  showed  that  all  the  formulas  derived 
under  the  classical  assumption  of  dilute  Poisson  process  also  hold 
under  more  general  and  realistic  conditions.  Their  model  is  the 
following:  let  {(x^Bj)}  be  a  marked  point  process  where  {x^}  is 
any  stationary  point  process  (the  centers)  and  {6^}  are  marks  (the 
radii) .  This  model  allows  dependencies  between  diameters  and  midpoint 
distances,  situations  that  usually  occur  in  nature. 


!.  The  classical  procedures. 

The  classical  procedures  make  use  of  the  inversion  formula  that 
expresses  the  particle  size  distribution  G  as  a  function  of  the  distri- 
but ion  H  or  density  h  of  the  observed  data.  An  estimate  G  is 

*w  «s. 

obtained  by  replacing  H(h)  in  the  formula  by  an  estimate  H  (h) . 

1.2a.  Random  line. 

The  Inversion  formula  is 


(1.2.1) 


1  -  G(6-) 


1  h(6) 

0  h'(0) 


where  G(0-)  “  11m  G(t)  . 

t+e 


So,  an  estimate  is 


(1.2.2)  1  -  C<8-)  -  £ 

0  h' (0) 


where  h  is  the  spectral  derivative  of  Hn, 
the  sample  c.d.f.  based  on  y^,...,yfl. 


Note:  Spectral  differentiation  is  a  stable  numerical  differentiation 


procedure  developed  in  a  time  series  context  (Anderssen  and  Bloomfield, 


1973) .  For  further  discussion,  see  page 


1.2b.  Random  plane. 

The  inversion  formula  is 


(1.2.3) 


1  -  G(0) 


dH(y) 
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So,  an  estimate  Is 


6 


For  further  details,  see  section  2.2  (random  line),  section  3.2  (random 
plane)  and  section  4.2  (thin  slice). 


1.3.  The  basic  problem  with  the  classical  procedures. 

We  must  remember  how  H  comes  about.  H  is  not  just  any  arbitrary 
distribution  but  it  is  the  distribution  of  the  data  observed  through  the 
random  probe.  In  a  way  that  depends  on  the  sampling  mechanism  (random 
line,  plane  or  thin  slice),  H  is  related  to  the  particle  size  distri¬ 
bution  6.  Hence  H  ■  Hq.  In  other  words,  H  has  to  belong  to  the 
admissible  range  {H^}  and  this  admissible  range  depends  on  the  sampling 
mechanism. 

The  basic  problem  with  the  classical  procedures  is  that  they  do  not 
take  the  structure  of  the  problem  into  account.  They  estimate  H  by  the 
sample  c.d.f .  or  a  smoothed  version  of  it  and  then  substitute  this  estimate 
H  into  the  inversion  formula.  This  leads  to  the  following: 

<v  «*» 

i)  H  j.  (Hg)  with  the  result  that  G  is  not  a  distribution  function. 
For  example,  in  the  random  plane  case,  the  back  substitution  procedure 
uses  H  -  Hq,  the  sample  c.d.f.  (1.2.4)  becomes 

l  — * - 

i  -  Gn(e)  - - 1 — 

As  an  estimate  of  G,  Gq  has  the  following  undesirable  properties: 

1)  1-G^  is  increasing  in  each  of  the  intervals  [y^,yi+j)  where 
we  assume  the  y’s  are  already  ordered. 

2)  1— G  has  singularities. 


Arguing  backward,  sine*  G  is  not  a  distribution  function,  H  t  (H_) , 

n  n  u 

and  this  is  the  basic  pro  Has  of  back  substitution.  There  is  no  reason 

to  believe  that  if  we  smooth  H  to  H  ,  H  e  {H0} .  Thus  Anderssen 

n  n  n  v> 

and  Jakeman's  procedure  has  the  saae  basic  problem  (see  section  3.3). 

ii)  The  saae  estimate  of  H  is  used  regardless  of  the  sampling 
mechanism  and  the  kind  of  data  (linear,  planar  or  thin  slice). 

ill)  They  cannot  incorporate  additional  information  concerning  G. 
Such  additional  information  is  available  in  the  thin  slice  case,  where 
if  G  is  discrete  with  finite  support,  the  support  can  be  determined 
from  the  data  with  high  probability  (see  section  4.5). 

1.4.  Formulation  of  the  problem  of  estimating  mixing  distributions. 

Let 


yl,y2* 


•yr 


i.i.d.  'v  PQ 


where 


PQCy)  -  /  F0(y)  dQ(0)  . 

Q(0)  is  called  the  mixing  distribution.  {F0}  is  a  parametric  family 
of  distributions. 

If  F0  has  density  f0,  then  Fp  has  a  density  fp(y)  •  /  fg(y)dQ(0) 
but  it  is  possible  that  Fp  has  density  even  though  F0  has  no  density. 
The  problem  is  to  estimate  Q(0)  from  y^*...,yn. 


For  the  estimation  problem  to  make  sense,  the  ident  if  lability  condi¬ 
tion  has  to  be  satisfied,  l.e. 


1.5.  The  connection  between  the  stereological  problem  and  estimation 
of  mixing  distributions. 

Let  6(8)  be  the  distribution  of  the  radii.  The  arrow  denotes 

the  event  that  a  sphere  is  hit  by  the  random  probe.  Then 

H(y]6,+)  is  the  conditional  distribution  of  y  given  that  a 
sphere  of  radius  8  is  hit  by  the  probe. 

P(+|8)  is  the  probability  that  a  sphere  of  radius  8  is  hit  by 
the  probe. 

H(y | +)  is  the  distribution  of  the  observed  y's.  The  arrow 

is  there  since  y  is  measured  only  when  a  sphere  is  hit  by 
the  probe. 

Thus 


(1.5.1) 


H(y| +)  -  /  H(y|8,+)  dG(8|+) 


but 

(i.5*2)  aocelo  - 

SO 

(1.5.3)  H(y|t)  -  !  d°W  . 

/  P(+|8)  dG(8) 
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As  we  will  see  shortly,  in  ell  three  cases  (random  line,  plane  and  thin 
slice) , 

H(y | 9,+)  is  a  parametric  family 

P(tjd)  is  also  known. 

Therefore  (1.5.1)  tells  us  that  G(0|+)  is  a  mixing  distribution  (in  the 
notation  of  section  1.4,  Q(0)  ■  G(0|+),  Fg(y)  ■  H(y|0,+)).  Then  (1.5.2) 
tells  us  that  estimating  G(0|+)  is  equivalent  to  estimating  G(0) .  He 
note  (1.5.3)  is  the  formula  on  which  stereologists  have  been  focusing  their 
attention,  where  P(+|0)  always  cancels  with  a  term  in  the  denominator  of 
H(y|0,+),  and  the  resulting  integral  equation  can  be  inverted  to  give 
G(*)  in  terms  of  H(*|i)  or  h(*|t).  Call  that  relationship  (1.5.4)  for 
ease  of  reference.  Because  of  the  possibility  of  cancellation  and  inver¬ 
sion,  no  attention  is  paid  to  (1.5.1).  Consequently,  stereologists  have  for 
years  overlooked  the  fact  that  G(8],f)  is  a  mixing  distribution.  Perhaps 
this  is  understandable  since  it  seems  unlikely  one  would  not  cancel  when 
cancellation  is  possible.  Instead  of  using  only  (1.5.4)  (see  section  1.2), 
our  proposal  is  that  (1.5.1)  should  also  be  used  along  with  (1.5.4)  since 
they  complement  one  another. 

He  now  look  at  the  three  cases  more  closely,  writing  out  (1.5.1),  (1.5.2), 
(1.5.3),  (1.5.4)  explicitly.  Their  derivations  are  elementary,  and  the 
interested  reader  is  referred. to  Coleman  (1979). 

1.5a.  Random  line. 

j  0  <  y  <  0 

H(yie,t)  has  density  h(yle,t)  -  <  6 

V.  0  otherwise 

p(t|e)  «  02  . 


10 


So,  wo  hav* 


(1.5.1a)  h(y|+)  -  f  3f-dG(0|+) 

iy  eZ 

(1.5.2a)  dG(6 1  +)  -  -el..dcIS> 

y2G 

whore  m2(J  »  /  02  dG(6) . 


(1.5.3a) 


2 

Notice  the  cancel let Ion  of  8  . 
(1.5.3a)  can  be  Inverted  to 


(1.5.4a) 


"  G<0“>  "i^foT+T  * 


1.5b.  Hendon  plane. 


H(yj0,t)  has  density  h(y|0,f)  -  y 


B/62-y2 


0  <  y  <  8 


otherwise 


P(+j0)  «  0  • 


So  we  have, 


(1.5.1b) 


My|+) 


-c 


)A2-y2 


dG(0|+) 
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(1.5.2b) 


welt)  -lijtfai 

mg 


where  y_  ■  /0  dG(0)  . 

V 


(1.5.3b) 


h(y|+)  -  J 


—±=z  jrdoce) 

*  AV  0 


Notice  the  cancellation  of  0. 


(1.5.3b)  can  be  Inverted  to 


(1.5.4b)  1  -  G(0)  -  f  dH(yl+)/  (  “  dH(y|+) 

in  /  2  .2  )n  y 


0/y2-02 


1.5c.  Thin  slice  of  thickness  2t. 


1  -  H(y|0,t)  - 


^V+ 


0  £  y  <  0 


y  £  0  . 


Note:  H(y | 0,+)  has  a  jump  at  0  which  can  be  explained  physically. 
One  consequence  is  that  the  formulas  derived  In  the  literature  are 
valid  only  whan  G  Is  continuous  (see  section  4.2). 


P(+| 0)  «  0  +  T 


So,  we  have 
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(1.5.1c)  1  -  H(y|+)  -  -yg--/  T  dG(8|+) 

Jy 


(1.5.2c)  dG(0|f)  - 

VG 


(1.5.3c) 


/e2-r2  +  T 

wg  +  t 


dG(0) 


Notice  the  cancellation  of  0+T  . 


If  G  Is  continuous,  (1.5.3c)  can  be  Inverted  to 


(1.5.4c) 


1  -  G(0)  - 


, —  w^/2 

where  f  (w)  -  /2ir  e  '  (l-$(w))  and  0  Is  the  standard  normal  distri¬ 

bution  function. 

If  G  is  mixed  or  discrete,  the  correct  formulas  Involve  a  decom¬ 
position  of  H  into  its  discrete  and  continuous  component  (see  section 
4.2).  This  makes  the  estimation  problem  more  complicated  but  also  more 
Interesting,  especially  In  the  discrete  case.  The  discrete  case  Is 
Interesting  because  of  the  following: 

1)  There  are  two  inversion  formulas  from  which  we  can  derive  an 


J]  *(’ >)dB(yl+) 

j"  f(^)dH(y|+) 


estimate  of  G 


'■V 

,  *  I 

v\ 


5 


it)  Part  of  the  thin  slice  data  can  be  treated  as  planar  data. 
Ill)  The  support  of  G,  if  it  is  finite,  can  be  determined  from 
the  data  with  high  probability. 

For  details,  see  section  4.5. 


Note:  The  case  with  truncation  (i.e.  only  y  ^  y0  is  observed)  can  be 
reduced  to  the  case  without  truncation  (see  sections  2.5,  3.8). 


- 


1.6.  Summary  of  notations. 

a.  For  mixture  of  distributions. 

Q(6)  denotes  the  mixing  distribution. 

(FgC*)}  is  the  parametric  family  of  dlstri hit ions  that  we  are  mixing. 
ifg(*)}  are  the  densities  of  (F^(*)}  when  they  exist. 

(1.6.1)  FQ(y)  -  /  Fg(y)  dQ(0) 

fQ(y)  -  /  fe(y)  dQ(0)  if  f0  e:..u*s 

■  ^  /F0(y)  dQ(0)  in  general. 

b.  For  the  stersologlcal  problem. 

G(0)  is  the  distribution  of  the  radii  of  the  spheres. 

G(0|f)  -  distribution  of  6  given  the  sphere  is  hit  by  the  probe. 
H(y|0,t)  -  distribution  of  tha  observed  y  given  that  a  sphere  of 
radius  0  is  hit  by  the  random  probe. 

H(y|t)  -  distribution  of  the  observed  y's.  is  there  since  y  is 
observed  only  when  the  probe  hits  a  sphere. 

P(+|8)  -  probability  that  a  sphere  of  radius  0  is  hit  by  the  random 
probe. 

Again,  small  letters  g,  h  denotes  densities  when  they  exist. 
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employed 


Since  H(y|+)  “  /  H(y|0,+)  dG(0|+).  By  comparing  with  (1.6.1) 


Stereological  Problem 


Similar  Identification  for  densities 


Mixture  of  distributions 


G(0|+) 

<-> 

Q(B) 

H(yle.i) 

<— > 

p0(y) 

H(y|t) 

<-c> 

Vy) 

h(y|0,+) 

h(y|+) 


f0(y) 

fQ(y) 


1.7.  The  new  approach. 

Recall  the  following:  (1.5.1)  tells  ua  that  G(6|t)  is  a  mixing 
distribution.  (1.5.2)  tells  us  that  estimating  G(0|t)  is  equivalent 
to  estimating  G(0) .  These  motivate  the  following  procedure: 

Step  1.  Estimate  the  mixing  distribution  G(6[+),  call  that 
estimate  G(0|t). 

Step  2.  Obtain  G(0)  from  G(0[+)  using  (1.5.2). 

Sometimes  we  obtain  H(yjt)  -  F„(y)  more  directly  than  G(6|+)  •  Q(0). 

Q 

In  that  case,  we  propose  an  alternate  procedure  which  illustrates  how 
(1.5.1)  and  (1.5.4)  complement  one  another. 

Step  1.  Obtain  H(y|+)  ■  F  (y) . 

Q 

Step  2.  Substitute  H  into  the  Inversion  formula  (1.5.4)  to  obtain 
This  procedure  works  nicely  in  the  random  line  case  (see  section  2.3). 
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1.8.  Nonparametric  estimation  of  mixing  distributions. 


The  purpose  of  this  section  Is  to  describe  some  nonparametric 
procedures  which  we  will  apply  later  (either  directly  or  with  slight 
modification) . 

For  notations,  see  section  1.6. 

1.8a.  Nonparametric  maximum  likelihood  estimate  (HLE) . 

Assuming  F_  has  density  .  Then  F.  has  density  f., 
fQ(y)  -  /  f0(y)  dQ(0).  We  have  y1»y2» • *  * »yn*  ^  The 

procedure  is  to  find  f  that  maximizes  II  ^  f(y^)  subject  to 
f  c  {fql 

We  now  remark: 

1)  Maximizing  II  ^  f(yA)  determines  f  only  up  to  f (y^), . . .,f (yn)» 
further  argument  Is  needed  in  each  special  case  to  show  that  they  determine 
f  uniquely. 

A  A 

11)  By  Ident  If  la  faUlty,  f  ■  f*  determines  Q,  but  this  is  only  an 

A  A 

existence  theorem,  so  we  need  a  procedure  for  finding  Q  from  f .  This 
may  not  be  easy  In  general,  but  for  the  problem  of  unfolding  particle 
else  distribution,  something  nice  happens.  Recall  that  (1.5.1)  tells  us 
we  have  a  problem  in  estimating  a  mixing  distribution  by  the  identifica¬ 
tion  F0(y)  »  H(y|8,  i)  and  Q(0)  -  G(0|  +  ),  Fp(y)  -  H(y|  +  )  but,  after 
all,  it  is  G<9),  not  G(6|  +  ),that  we  are  after,  and  (1.5.4)  gives  G(*) 
in  terms  of  H('*|t). 

The  above  illustrates  how  (1.5.1)  and  (1.5.4)  complement  one  another. 
(1.5.1)  provides  the  point  of  view  that  the  distribution  of  y  is  a 
mixture  which  we  can  estimate  using  existing  methods.  Then  (1.5.4) 
performs  the  inversion  for  us  to  give  an  estimate  of  G. 


ill)  For  finite  mixture  with  known  support  points,  we  can  use  the  EM 
algorithm  (Dempster,  Laird  and  Rubin,  1977)  to  find  Q  directly  without 
the  intermediate  step  £.  This  is  useful  since  frequently,  a  mixture 
problem  can  be  reduced  to  a  finite  mixture  with  known  support  points. 

We  now  give  a  brief  description  of  the  EM  algorithm  in  its  general 
setting.  Let 

(y,z)  ^  fA(y,z) 

V~'*>  "  108 

where  y  are  the  observed  data,  (y,z)  are  the  complete  data.  Define 

qC4>* |4>>  -  E$(*4>»(y,;>ly>  • 

Each  iteration  -*•  of  the  EM  algorithm  involves  two  steps  which 

we  call  the  expectation  step  (E-step)  and  the  maximization  step  (M-step) . 
E-step :  Compute  Q($  |  ♦^  ) . 

M-step:  Choose  that  maximizes 

The  heuristic  Idea  here  is  that  we  would  like  to  choose  the  <J>*  that 
maximizes  &^(£,z).  Since  we  do  not  know  &^(y,z),  we  maximize  its 
current  expectation  instead  given  the  data  y  and  the  current  fit  . 

When  we  specialize  to  the  case  of  a  finite  mixture, 

^  m  (0^ , 02 ,  •  *  •  , 0 F^ , . . . , P|^ ^) 
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r-'- 


Let  Z±,  i  -  l,...,n  be  i.i.d.  P^Zj-j)  -  and  yjz^  j  ^  f^  (yj), 
i  *  l9...9n  Independent.  Then 


k  n 

I  V 

j-i  J  i 


n  fg  (yA)  where  n^ 


■  #  i  such  that  *  j  , 


*i«j 


and 


k  k 

l6(y,2)  ■  I  n  log  P.  +  J  l  log  fg  (y  ) 
9  ~  ~  j-1  3  J  j-1  1  1 


Vj 


depends  on  $  depends  on  <{> 

only  through  only  through 


P1 . Pk-1 


91 . 9k 


Thus  to  apply  the  EM  algorithm,  we  maximize  the  conditional  expectation 
of  the  two  sums  separately  given  y  and  4>v  ' .  In  particular, 

E  fnO(nJy) 

.3 

j  n 

P1(0>  f  fn,)(yi> 

.  n  *  84(m)  1 

.1  I  - J -  . 

n  i-1  r  (m)f  ,  } 

lii  P*  efl(B)(  A> 


if  e1#.. 


.  ,6fc  are  known,  it  reduces  to 


-i  i  - 

J  n  i-l  r 


P^n^f  (v  } 

n  PJ  t8jlV 


,-l  L  l 
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b)  Minimum  distance  method. 

Motivation:  Since  yi*y2****»yn  %  Fq  and  determines 

Q  by  id ent if lability,  all  we  need  to  do  is  to  obtain  an  estimate  F 

A 

of  Fn  from  y, ,y„,...,y  •  However,  we  must  remember  that  F  has 
j.  i  n 

to  belong  to  the  admlssibLe  range  {Fq}.  Thus,  for  example,  if  Fg 

is  continuous,  then  so  is  Frt  ■  /  F.  dQ(0)  regardless  of  Q.  So,  in 

Q  0 

this  case,  the  sdmple  c.d.f.  £  {F^}.  t^ie  nonparametric  MLE 

A 

of  F,  but  if  Q  is  the  nonparametric  MLE  of  Q,  then  the  corresponding 

FA  would  be  the  restricted  MLE  of  F  subject  to  Ft  {f«)  and  not  F  , 

Q  Q  n 

the  unrestricted  MLE.  Since  the  restricted  MLE  may  be  hard  to  find  and 
sometimes  can  be  inconsistent,  the  method  of  minimum  distance  is 
proposed. 

The  method  can  be  described  as  follows:  Among  all  Q  that  put 
positive  probability  at  n  points,  find  that  minimizes  d(Fq»Fn) 

where  d(* , •)  is  some  distance  function  and  is  the  sample  c.d.f. 

of  the  y's.  There  are  variations  due  to  the  choice  of  d(»,»)  and 
as  to  whether  the  n  points  of  support  are  pre-specif led  or  are 
among  the  parameters  in  the  minimization  problem. 
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it)  Choi  and  Bulgren's  approach. 

2 

d(F,F  )  *  /(F(y)-Fn(y))  dFn(y) ,  the  Wolfowitz  distance.  The 
n  points  of  support  ®in*  *  *  *  ’^nn  are  amon8  parameters. 

Under  the  regularity  conditions > 

1)  identif lability . 

2)  lim  F„(y)  ■  F^y)  exists,  F00(*)  is  not  a  distribution. 

0  -*•  a> 

3)  F0(y)  continuous  in  (0,y> 

both  approaches  give  a  consistent  estimate  of  Q. 


1.9.  The  advantages  of  the  new  approach  over  the  classical  approach. 

Our  new  approach  takes  the  structure  of  the  problem  into  account. 
Hatching  the  drawbacks  listed  in  section  1.3  of  the  classical  procedures, 
we  have  the  following: 

i)  He  (H  }  ■  F_  .  The  Importance  of  having  H  lie  within  the 

admissible  range  is  highlighted  in  the  random  plane  case  where  we  want 

to  bootstrap  the  distribution  of  a  stereological  estimate.  In  section 

3.7,  we  show  that  the  usual  way  of  resampling  from  the  sample  c.d.f .  in 

the  bootstrap  does  not  work.  We  find  out  that  it  is  crucial  to  resample 

from  H  belonging  to  the  admissible  range. 

ii)  For  different  kinds  of  data  (linear,  planar  or  thin  slice), 

H  is  the  mixture  of  different  parametric  family  of  distributions,  hence 

the  estimate  H  would  also  be  different. 

lii)  The  new  approach  can  easily  Incorporate  additional  information 

concerning  G  to  give  a  better  estimate.  For  example,  if  we  know  the 

support  of  G,  hence  also  of  Q  ■  G( • I t) ,  is  (0, ,...,0  ),  then  the  MLE 

l  m 
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Q  maximizes  E  °  f»(y..)  among  all  Q  that  have  support  {6, ,...,0  }, 

l«l  y  i  l  m 

(Instead  of  ^®in*  *  *  *  *  Similarly,  the  minimum  distance  estimate 

$  minimizes  d(Fn,F  )  among  all  Q  that  have  support  (0, . 0  }. 

x  n  lo 

Since  m  is  fixed  and  does  not  grow  with  the  sample  size  n,  the 
reduction  in  the  amount  of  computation  can  be  substantial. 


1.10.  Some  peculiar  features. 

It  is  worth  pointing  out  that  there  are  some  peculiar  features 
which  make  the  problem  unique  and  more  than  just  a  routine  special 
case  of  estimating  a  mixing  distribution. 

i)  The  integral  equation  (1.5.3)  can  be  inverted  to  (1.5. A)  which 
gives  G(*)  in  terms  of  H(*|+)  (h(*|+)  In  the  random  line  case).  Thus 
bacr  ibstltutlon  is  possible.  That  is,  we  can  just  find  an  estimate  of 
H(*f+)  (h(«|+)),  call  that  fi(*i+)(K(*|+)),  plug  it  into  (1.5. A)  and  it  will 
give  us  an  estimate  G  of  G.  In  particular,  if  H  *  H  ■  F^,  the 
restricted  MLE  (recall  the  identification  Q(0)  •  G(0|f), 

F0(y)  -  H(y|0,+),  Fp(y)  -  H(yj+)),  then  G  •  G,  the  nonparametrlc 
MLE  of  G.  A  work  of  caution  is  that  if  H  £  {F^},  ®  will  not  be  a 
distribution  function. 

il)  For  the  random  plane  case. 


h(y|6,+) 


— -f - 0  <  y  <  0 

^  0v4*-y2 

0  otherwise 


has  a  singularity  at  y  -  0. 
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ill)  For  the  thin  slice  case. 

H(y|0,+)  has  a  jump  at  y  •  0. 

iv)  As  a  problem  in  estimating  a  mixing  distribution,  the  Fg(y) 
i.e.  the  H(y|6,+)  in  the  three  cases  are  so  peculiar  that  no  one 
would  think  of  doing  these  special  cases  if  they  did  not  arise  out 
of  a  practical  application,  which  is  the  case  here. 
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CHAPTER  II 


THE  RANDOM  LINE  CASE 


2.1.  The  basic  formulas  . 


(2.1.1) 


hy(y|+)  -  P 


%  dG(0|+) 

y  e 


(2.1.2) 


dG(0i,).ei^l 


2G 


where  y2(J  -  /  B*  dG(0). 


(2.1.3) 


^(ylt)  -  I 
Jy 


■  dG(0) 


2G 


Equation  (2.1.3)  can  be  inverted  to  give 


1  -  G(0-)  - 


V8>+)  y2G 


0  2 


Taking  the  limit  as  0+0  gives  1  ■  b^(0|t) 


(2.1.4) 


x  V8l+) 

0  hJTopFT 


1  -  G(0-) 


K 


2.2.  Survey  of  the  Literature. 

Since  y1,y2,...,yn  l.l.d.  <\#  h^ylt),  we  can  obtain  seme  estimate 
hy(*|i)  from  the  date.  Then  an  estimate  of  G  is 


(2.2.1)  1  -  8(0-)  -  I  Vi!*i  . 

6  5j(0|t) 

This  procedure  Is  found  to  be  unsatisfactory.  Some  objections  are  the 
following : 

1)  The  choice  of  ^(*1+)  Is  arbitrary. 

H)  As  far  as  we  know,  no  stereologlst  attempts  to  find  the  whole 
density  h^- |t) .  They  Just  differentiate  Hn(y)  (the  sample  c.d.f. 
of  the  y's)  numerically  to  obtain  h(*|t)  on  an  even  grid.  The  numerical 
procedure  proposed  is  spectral  differentiation:  a  stable  procedure 
developed  in  a  time  series  context.  (Andarssan  and  Bloomfield,  1973). 

ill)  Since  h(* | t)  Is  evaluated  only  on  an  even  grid,  we  also  need 
to  estimate  h^(0|t)  which  is  very  herd  to  estimate  from  the  data, 
iv)  if  g(0)  has  density  g(6),  then  from  (2.1.4), 


-i  _d 

hj(0|+)  d0  1  0  > 


Thus  to  estimate  g(0)  starting  from  the  sample  c.d.f.  H  ,  we  need 

n 

to  obtain  the  spectral  derivative  of  order  2.  Simulation  results  of 
Jakaman  and  Anderssen  (1976)  for  a  blmodal  g(0)  shows  that  the  proce¬ 
dure  Is  unsatisfactory. 

v)  Another  factor  which  contributes  to  the  failure  Is  the  effect 
of  the  factor  in  the  neighborhood  of  0. 
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Because  of  Che  above,  many  writers  (e.g.  Nicholson,  1970,  WaCson, 
1971,  Moran,  1971,  Jakeaan  and  Andersaan,  1976)  have  advised  that  linear 


probe  data  should  be  avoided. 

2.3.  Nonpara— trie  MLB. 

2.3a.  Derivation  and  computation. 

The  nonparaaetric  MLE  of  G  can  be  derived  as  follows: 


1)  Make  the  transformation  Z  ■  Y2. 

Since 

yy|M>  - 

0  <  y  <  0 

h^sle, t)  -  -y 

0  <  z  <  0 

So 

hz(zi+)  -  r 

4  deoK)  • 

e 

Si 

ii)  Let 

i  -  e2  . 

Then 

h2(.i+, -  f  i 

dG(C|+) 

which  tells  us  that  | Is  a  mixture  of  uniforms, 

ill)  Densities  which  are  mixtures  of  uniforms  coincide  with 
left  continuous  nonincreasing  densities.  The  nonparametrlc  MLE 
in  this  case  is  well  known  (Barlow,  Bartholomew,  Brenner  and  Brunfc, 

1972,  P.  223-228). 

Pact  1:  Hz<»Jt)  is  the  least  concave  majorant  of »  Hn(0»  the  —plrical 

1  2  2  2 
c.d.f.  which  put  mass  —  at  t  ■  y^»y2»,,,»yn* 

Fact  2:  hz(*|t)  is  a  left  continuous  non increasing  step  function  with 

Jumps  at  z  -  yj.yj . y*.  In  fact,  hz(*|+)  is  the  slope 


iv)  An  algorithm  for  computing  the  MLE,  like  tha  "Up-and-Dovn 
Block*"  algorithm  can  ba  found  in  Barlow,  Bartholooav,  Browner  and 
Brunk  (p.  72),  This  algor It hn  la  aaally  prograaaed . 

v)  By  invar lanca,  tha  nonpar  saiat  ric  MLE  of  hj(*|+)  la 

(2.3.1)  hyCyl^)  -  2yh2(y2|+) 

(2.3.2)  Fact  2  +  h^(0|+)  -  2hz(y2|+) 


whore  we  aesuae  y1,y2,...,ya  Is  already  arranged  in  increasing 
order. 

Substituting  (2.3.1)  and  (2.3.2)  and  (2.2.1)  gives 


*  26h»(e2|t) 

1  -  6(6-)  -  —2 - «—  * 

02h^(y2[l) 


(2.3.3) 


1  -  G(0-) 


^(e2!*) 


Fact  2  combined  with  (2.3.3)  tells  us 
Fact  3:  G  is  discrete,  putting  mass 


^(jrjlt)  -  VTjtl) 

A  2.  t  •  •  •  » 

vKi** 

SzO”  -  **<#♦>  *,<#♦> 


at  tha  points  y, , ...,y_ 


i  ,7- 


Th«  nonparamet  r  1c  MLE 


to  provide  an  answer  to  the  objections 


raised  against  the  classical  procedures  (section  2.2).  Objections  1,  11, 
ill,  are  answered  since  we  choose  h^(»|+)  “  h(*|+),  the  nonparsaetrlc 
MLE  which  is  well  defined.  Objection  lv  Is  answered  since  In  order  to 
estlaate  g(8),  we  differentiate  (2.3.3)  which  requires  a  spectral 
derivative  of  order  1  Instead  of  2. 

Objection  v  is  also  answered  since  the  factor  ^  Is  cancelled  in 
(2.3.3). 

2.3c.  Consistency  of  mle. 

From  (2.3.3),  consistency  of  G  follows  from  consistency  of 
hz(<|t)  which  is  proved  In  Barlow,  Bartholomew,  Bremner  and  Brunk 
(P.  228). 

2.4.  Simulation  results. 

The  stationary  distribution 

2 

g(8)  -  -y  exp( - 

c  2c 

is  employed  so  that  h^(*|+)  ■  g(*)»  In  what  follows,  c  *  4.0.  The 
sample  size  n  is  taken  to  be  100.  Thus  *  *  *  *^100  are  8enerate^ 

according  to  hy( • | t) . 

The  nonparametric  MLE  is  computed  using  fact  3  of  section  2.3,  where 
fi^(*| +)  is  computed  using  the  "Up-and-Down  Block"  algorithm.  Averaging 
over  100  trials,  the  result  Is  as  follows. 


TABLE  1 


A 

The  average  of  the  MLE  GO)  and  the  average  percentage  error  over  100 
aaaplea  of  alse  100  dram  from  the  stationary  distribution 

M9>  -  ,(6)  -  &  .-e2'32 
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G(0) 

6(0) 

Z  ERROR 

0.6070 

0.0114 

0.3470 

3025.25 

1.2139 

0.0450 

0.4807 

1015.94 

1.8209 

0.0984 

0.5214 

462.291 

2.4279 

0.1682 

0.5636 

252.551 

3.0349 

0.2501 

0.6039 

153.999 

3.6418 

0.3393 

0.6481 

99.8950 

4.2488 

0.4311 

0.7029 

69.4484 

4.8558 

0.5214 

0.7499 

47.3913 

5.4627 

0.6065 

0.7900 

35.1072 

6.0697 

0.6838 

0.8329 

25.4512 

6.6767 

0.7517 

0.8726 

18.2788 

7.2837 

0.8095 

0.9030 

12.6061 

7.8906 

0.8571 

0.9298 

9.2314 

8.4976 

0.8953 

0.9457 

6.3706 

9.1046 

0.9250 

0.9602 

4.5193 

9.7115 

0.9475 

0.9712 

3.3260 

10.3185 

0.9641 

0.9833 

2.2494 

10.9255 

0.9760 

0.9899 

1.5802 

11.5324 

0.9843 

0.9942 

1.1271 

12.1394 

0.9900 

0.9969 

0.8149 

The  reaults  look  horrible  but  this  has  to  do  with  the  fact  that  the 
MLE  la  badly  "aplked"  at  the  node.  This  peculiar  "peaking"  near  the 
node  of  the  MLE  waa  observed  by  Wegman  (1970a, 1970b) .  He  also  suggested 
a  partial  solution  by  requiring  the  estimate  to  have  a  modal  interval 
of  length  e  ,  where  e  is  some  fixed  positive  number. 

By  pooling  the  first  ten  observations  into  one  block  and  then  applying 
the  "Up-and-Down  Block"  algorithm,  which  is  essentially  a  modified  version 


of  Wegnan's  procedure,  we  are  able  to  obtain  much  improved  estimation 
reaults.  Call  this  estimate  G^.  We  compare  G^  with  various  versions 
of  the  classical  estimator. 


To  describe  the  various  versions  of  the  classical  estimator,  we 
need  the  following: 

From  0  to  12.1394,  the  0.99th  quantile  of  G,  we  fore  a  grid 
of  101  equally  spaced  points  t^  •  0,  t2*****ti00*tl01  “  12.1394.  Let 
Hn  he  the  sample  c.d.f.  of  Hy(*|+)  so  that  k-l,...,101) 

represents  the  values  of  on  that  grid.  Since  we  do  not  have  exact 

data,  that  is,  ve  do  not  have  H(tk>  but  only  HQ(t^)  -  H(t^) +  cfc,  we 
need  spectral  differentiation,  a  numerical  differentiation  procedure  for 
non-exact  data.  Let 


h*(tk) 


t 


dH  (t)  * 

— S - l 

dt  Jt-t 


k 


■  spectral  derivative  of  order  1 
evaluated  at  t  -  for  the  data 

'W- 


Mote:  In  evaluating  the  spectral  derivative,  we  first  taka  away  a  least- 
squares  5th  degree  polynomial  trend  ?(•)  satisfying  the  four  end 
conditions 


P(tl)  -  Hn(tx) 


P(t101>  "  V101) 

w-w 


- 


- 


t2"tl 

V^Ol*  ~  VhoQ* 

tl01 "  t100 


29 


By  r Moving  an  avaraga  Crand  P(t)  about  which  tha  data  tand  to  vary 
in  a  naar  stationary  manner,  tha  rasidual  data  (v.  }  »  {h  (t.)-P(t.)} 

K  Q  K  It 

haa  tha  following  structura  which  is  asaantial  for  spactral  differen¬ 
tiation  to  ba  applicable. 


(2.4.1)  vfc  -  u(tk)  +  ek 

where  u(t),  the  smooth  function,  ia  a  stationary  stochastic  process 
with  continuous  parameter  t  and  sero  expectation  and  ,  the 
measurement  error,  is  defined  by  a  stationary  stochastic  process  with 
discrete  parameter  k  and  zero  expectation. 

This  permits  us  to  apply  spectral  differentiation  to  (vk)  to 
obtain  u'*(tk>,  tha  spectral  derivative  of  u(t) .  Since 


W  -  H<'k>  +  ck 


v 


k 


W  - 


-  HCtk)  -  PCtk)  +  ek  . 


Compare  with  (2.4.1),  we  have 


H(tfc)  -  u(tfc)  +  ?(tk>  • 

So  b*(tfc)  *  u'*(tk)  +  P'(tk),  which  we  call  the  spectral  derivative  of 
H(t) ,  is  an  estimate  of  b(tk)  -  H*(tk). 

A  natural  candidate  for  the  trend  is  a  least-squares  polynomial. 
Tha  first  two  end  conditions  we  imposed  are  natural  and  conditions, 
while  the  last  two  represent  a  numerical  expedient  introduced  to  cope 
with  a  shortcoming  in  spectral  differentiation.  Since  the  relative 
accuracy  of  a  spectral  derivative  deteriorates  only  in  tha  neighborhood 
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of  the  endpoint a,  the  lest  two  and  conditions  ensure  that  the  value  of 
the  derivative  to  be  eat last ad  by  spectral  differentiation  is  small 
compered  with  the  actual  derivative.  (That  is,  u*  ■  H'-P'  is  small 
compared  with  H'  at  the  endpoints).  Since,  we  have  four  end  conditions, 
we  remove  a  least-squares  5®"®  degree  polynomial  so  that  we  have  two  free 
parameters. 

We  are  now  ready  to  describe  the  various  versions  of  the  classical 
estimator,  G. 

Version  1.  G^.  Since  G  is  continuous, 

h_(0|+) 

1  ■  G(e)  ■  e  h^(o|+) 

h*(t.) 

1  “  W  "  t'k  h^(0|  +)  »  k  "  1»*,,»101  • 

That  is,  we  use  the  true  value  of  h^(0[f)  Instead  of  estimating  it.  As 
a  result,  it  would  give  an  overly  optimistic  assessment  of  the  method. 


Version  2.  G2. 


1  -  02(tk)  -  -  h*(t  )  •  k  ■  1 . 101 

A.  /  _  *•  i  \ 


^  t  -  t 

a  c2 


That  is,  h^(Ojt)  is  estimated  by 


h*(t2)  -  h*(t1) 


*2  -tl 
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Version  3.  G^. 


h^(Oj+)  ie  estimated  by 


h*(t6)  -h*(tx) 


Version  4.  G^.  h^(0|+)  ■  H^(0|+)  Is  estimated  as  follows: 

-  spectral  derivative  of  order  2 
evaluated  at  t  ■  t^  for  the  data 

{W>  • 


Let 


d\  (t)  * 
r - 2 - ] 

1  dt2  Jt-fck 


d2H  (t)  *  „  d2H  (t)  * 

Define  1  -  mln(k:  [ - = — ]  >  0>  .  Then  h^(0[+)  “  t - 5 — ] • 

dtZ  Z  Ck  Y  dtZ  C  Ci 

The  same  5th  degree  polynomial  trend  described  earlier  is  first 
removed  before  we  compute  the  derivative. 

Version  5.  Gj.  Same  as  version  4  accept  that  we  first  take  away  a  least 
squares  7**1  degree  polynomial  trend  P( ')  satisfying  the  restrictions 


p<ti>  -  w 


P(t101>  "  Hn(t101> 


P’O^) 

P”(t1) 


t2- tL 

h*(t6)  -h*(t1) 


P’(t101> 


jn^lOl^  ~Hn(t100) 
t101 ~ t100 


P"(t  ) 
r  vcior 


h*<Wh*(t96) 
C101 "  *96 


where  we  recall  h*  is  the  spectral  derivative  of  order  1. 

Version  6.  Gg.  This  is  a  combination  of  version  3  and  version  4.  For 


ill  values  of  k,  say  k  -  6,  G, (tfi)  can  sometimes  be  negative, 


corresponding  to  those  cases  where 
Is  negative,  estimate  h^(o|+)  by 


hJ<o|t) 

h*(t6) 


is  small.  Thus  when  G^(tg) 
instead,  forcing  Gg(tg)  -  0. 


A 

A  comparison  of  with  G^-Gg  is  given  In  Table  2. 


TABLE  2 

A 

A  comparison  of  the  modified  MLE  G^(6)  with  six  versions  of  the  classical 
estimate  (Gj(0),  i“l, . . . , 6)  in  terms  of  the  average  of  the  estimates  and 
their  average  percentage  error  (shown  on  even  rows)  over  100  samples  of 
size  100. 


/\ 


e 

G(6) 

c1cn 

1.  214 

0.  045 

0.  014 
126.  22 

2.  420 

0.  168 

0.  135 
96.  291 

3.  642 

0.  339 

0.  356 
44.  985 

4.  836 

0.  321 

0.  543 
24.  813 

6.  070 

0.  684 

0.  701 
14.  602 

7.  204 

0.  810 

0.  323 
6.  322 

8.  498 

0.  895 

0.  899 
4  012 

9.  712 

0.  948 

0.  948 
2.  657 

10.  926 

0.  976 

0.  981 
1.  361 

12.  139 

0.  990 

0.  994 
0.  794 

Gjfl) 

G2(9) 

G3(8) 

-0.  024 

0.  166 

0.  200 

337.  79 

438.  92 

458.  46 

0.  192 

0.  291 

0.  342 

44.  720 

159.  95 

145.  88 

0.  396 

0.  433 

0.  486 

17.  923 

79.  617 

69.  311 

0.  336 

0.  566 

0.  615 

7.  686 

41.  325 

34.  796 

0.  678 

0.  676 

0.  716 

4.  248 

23.  312 

18.  211 

0.  793 

0.  789 

0.  818 

3.  119 

13.  045 

9.  930 

0.  885 

0.  883 

0.  898 

1.  959 

6.  432 

4.  853 

0.  932 

0.  953 

0.  959 

1.  740 

2.  972 

2.  556 

0.  981 

0.  987 

0.  986 

1.  237 

1.  366 

1.  337 

0.  997 

0.  996 

0.  996 

1.  733 

1.  649 

0.  630 

g4(0) 

Gs(0) 

g6(0) 

-2.  332 

0.  149 

0.  247 

5831.  0 

439.  06 

495.  22 

-1.  238 

0.  310 

0.  379 

1055.  1 

131.  31 

156.  07 

-0/665 

0.  464 

0.  515 

385.  96 

63.  998 

72.  812 

-0.  381 

0.  597 

0.  635 

215.  05 

32.  154 

35.  814 

-0.  108 

0.  701 

0.  731 

134.  08 

17.  553 

18.  274 

0.  250 

0.  307 

0.  827 

77.  735 

9.  811 

9.  737 

0.  554 

0.  892 

0.  904 

42.  178 

4.  889 

4.  750 

0.  J29 

0.  957 

0.  961 

15.  765 

2.  490 

2.  549 

0.  948 

0.  985 

0.  987 

5.  403 

1.  321 

1.  325 

1.  015 

0.  997 

0.  996 

3.  573 

1.  677 

1.  613 

'  m 


a 
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Conclusion .  Except  foe  vhlch  we  cannot  use  in  practice  since 
h^(0|+)  Is  unknown,  the  nonparametric  MLE  6  performs  better  than 
the  classical  estimators  in  terms  of  both  bias  and  percentage  error. 

Another  advantage  of  the  MLE  is  that  it  actually  gives  an  estimate 
G  of  G  whereas  the  classical  procedures  estimate  G  only  on  an  even 
grid. 


2.5.  Truncation. 


Suppose  y  is  observed  only  if  y  y^.  Then  if  0  <  y^,  we  can 
never  observe  y.  Whereas  if  we  have  a  sphere  Q  of  radius  0  >_  y^ 
with  center  at  C,  then  we  observe  y  >.  y^  if  and  only  if  the  linear 
probe  L  hits  a  sphere  Q'  of  radius  0*  «/e2-yg  also  centered  at  C. 
The  intersection  of  L  with  Q'  is  a  line  segment,  let  y*  be  half 
the  length  of  that  segment.  The  following  diagram  shows  what  is  going  on. 


0  -yQ  given  that  0  >_  yQ> 


we  will  be  back  to  the  case  with  no  truncation. 
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CHAPTER  III 


THE  RANDOM  PLANE  CASE 
3.1.  The  basic  formulas . 


(3.1.1) 


h(y|+) 


r 

*  V 


2  2 
-y 


dG(8|+) 


(3.1.2)  dG(6}+)  -  9 

yG 


where  vG  -  /0dG(0).  So 


(3.1.3) 


Again,  (3J.3)  can  be  inverted  to  give 


1  -  G(9>  -  ^  f  S&lii  dy  . 

Setting  0*0,  we  have 

(3.1. A)  1  -  — 5  f  i  h(y|  +)d/  . 

*  J  0  y 


(3.1.5) 


1  -  G(0) 


,°3 


If  G  has  density  g,  then 


Since  y,,y  ,...,y  i.i.d.  'v  h(y|+),  one  can  replaced  h(y]+)dy 
12  n 

In  (3.1.5)  by  dHn(y)  where  Is  the  sample  c.d.f.  This  is  known 

as  back  substitution,  we  have 

1  -  Gn(9)  . 


1-Gn  is  unsatisfactory  because  1-G  is  decreasing  but  1-G^  is  increasing 

in  each  of  the  Intervals  ty_  ,,y_  where  y  ,<  y  «<•••<  y 

a  f  x  Qy  i » x  u  |  x  n,  xi 

are  the  ordered  y's.  Moreover,  1  -  Gr  has  singularities,  it  goes  to 
infinity  as  0  +  y^,  i  ■  l,...,n. 

Anderssen  and  Jakeoan  (1975)  explained  the  poor  performance  of  back 
substitution  by  its  use  of  a  simple  rectangular  quadrature  approximation 


rather  than  a  more  appropriate  approximation  which  coped  with  the  singularity 


of  the  Integrand 
Integration.  H 


— - at  y  -  0. 

was  smoothed  to  H 


They  proposed  a  method  based  on  product 
using  a  localized  Lagrange  inter¬ 


polation  and  h(y|t)dy  in  (3.1.5)  was  replaced  by  dHQ(y) .  They  daaon- 


strated  some  desirable  properties  for  this  class  of  estimators. 
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Hi  can  estimate  G  by  first  estimating  H(*|+)  and  then  using  the 
inversion  formula  (3.1.5).  However,  Ch( *  |  +) )•  is  a  smaller  claes  than 
the  claas  of  all  distribution  functions  and  we  have  to  take  that  into 
consideration  when  we  estimate  H(*[+).  Thus  it  is  Important  to  have 
a  characterization  of  H(*|t). 

Since  H(y|8,-f)  has  density 


h(y|9,+)  -  -  ■  0  <  y  <  6 

H(y|+)  has  density 


dA2-y2 


r 


h(y|  +)  »  dG(0|+) 


h(y[t)  satisfies  the  following  conditions  : 


d) 


f  h(y|t)  dy  -  1  ,  h(y[+)  0 

i  0 


c2) 


h(y| +)  is  right  continuous 


c3) 


r  ^  b(y|+)  dy 
Jn  * 


< 


c4) 


r 


dy  is  nonincreasing 


Dot*:  cl  and  c2  are  obvious.  c3  follows  from  (3.1.4)  and  c4  la 
a  consequence  of  (3.1.5). 

On  the  other  hand,  if  h*  satisfies  cl-c4,  let  G*  be  defined  by 


C 


i  -  c*(e>  - 


C  y  h*<y>dy 


c2-c4  Imply  that  g*  is  a  distribution  function  with  G*(0-) 
and 


-  0 


J  0  dG*(0)  -  [  1-G*(0)d0 

J 0  J0 


rr 

Jo  J 0 


dy  d0 


h*(y)dy 


f  - 

Jo  7 

f  h*<y)  F  —L—  d0  dy 

J°  J°  /7^? 

I  7  i»*(y)«*y 
J  o  7 


TT 

2 


|  “  h*(y)dy 

Jo  7 


<  ®  . 


By  reversing  the  steps  in  proving  (3.1.5)  (see  Coleman,  1979,  F.53),  we  can 


h*(y)  -  |  ~  1 . dG*(8 1 1) 

‘y  e/eV 

where 


dG*(0j+)  -  ■  ■ 

[  0  dG*(0) 

Jo 


Therefore  cl-c4  is  a  characterisation  of  (h(*|t)}. 

Anderssen  and  Jakeman  explained  the  poor  performance  of  back  substitu¬ 
tion  by  the  Inability  of  the  sample  c.d.f.  to  cope  with  the  singularity 

of  the  Integrand  l//y2-02  at  y  *  0.  However,  a  more  basic  problem  is 
that  Hn  t  {H(* | +)  },  to  see  this,  note  that  Hr  is  discrete  while  H(*|f) 

has  density  h(*|i).  In  this  light,  even  though  Anderssen  and  Jakeman' s 

•>* 

product  Integration  estimate  Hn  copes  with  the  singularity,  there  is  no 
reason  to  believe  that  it  will  solve  the  more  basic  problem  that  Hq  may 
not  belong  to  the  admissible  range  {H(*|t)}.  In  the  appendix,  we  consider 
the  simplest  kind  of  product  integration,  namely,  piecewise  linear  inter¬ 
polation.  We  show  that  there  exists  G  such  that  P(Hn  violates  c4)>P>0 
for  n  sufficiently  large  and  some  P  >  0.  (See  A.l  of  Appendix). 

Note:  Saying  that  Hn  violated  c4  is  the  same  as  saying  that  Gq  is 
not  nondecreasing,  that  is,  Gr  Is  not  a  distribution  function. 

3.4.  Nonpar ametric  MLE. 

3. 4.  a.  Derivation. 

The  following  argument  shows  that  the  MLE  is  not  well  defined  as 


it  stands. 


Recall  the  identification  of  section  1.6, 

Q(6)  -  G(6j+) 

fA(y)  -  h(y|0,+)  ■ - y—  0  <  y  <  0 

e/e2-,2 

fq(y)  *  h(y|t)  -  |  fQ(y)  dQ(y)  . 


n 

The  problem  is  then  equivalent  to  maximizing  II  f .  subject  to 

i-1  1 

(f^, ...,fn)  e  eonv(D  where  conv(r)  -  convex  hull  of  T.  Because  of 

the  singularity  of  f Q(y)  at  8  «  y#  I*  is  not  bounded,  so  conv(D 

n 

is  not  bounded  and  we  cannot  maximize  II  f  . 

i-1  1 

Using  the  following  limiting  process,  we  can  get  around  the 


difficulty  of  the  singularity.  Let  e  ■  . eQ),  >  0.  Let 

~  n 

Q(e)  maximize  II  frt(y.)  but  only  among  those  Q's  which  do  not  put  mass 
i-1  q  1 


on  the  intervals  +  1  -  l,...,n.  By  doing  so,  we  avoid  the 

problem  of  singularity  and  f  is  now  bounded.  Using  the  fact  that  as  a 


function  of  0,  f0(y)  ■  0  for  0  <_  y  and  f0(y)  is  decreasing  for 
0  >  y  it  can  be  shown  that  G(e)  must  necessarily  put  all  its  mass  at 


{y1+€1»y2+e2** * •»yn+en^  with  corresponding  masses  P1(e),P2(e) . Pn(e). 

Now  If  (P, (e),...,P  (e))  -*■  (P, , .  • .  ,P  )  as  e  0  it  would  seem  reasonable 
i  -  n  —  x  n  ~ 

A 

to  define  the  MLE  Q  by  the  discrete  distribution  which  puts  mass  P^  at 
y±.  i  ■  l,...,n. 

Claim:  P.  (e)  —  as  e  -*■  0  for  all  1. 

1  n  — 

Proof:  Let  f^Ce)  “  fy  ^  (y^)  »  we  want  to  find  P1»P2,...,Pn  which 

maximize  H.11.  (E.n,P.f.,)  subject  to  P.  >  0,  P.,  +  ***+P  ■  1. 

j-1  '  i-1  1  ij  1  —  1  n 

Notice  that  f^ (e)  -  0  for  j  >  1.  Using  the  method  of  Lagrange 

multipliers,  it  reduces  to  maximizing 

10*<pifn  +  Vn  +-+  Pnfol> 

+  1°*<P2£22  +  P3f32  +"'+  Vn2> 

e 

+ 

-  a  (P,  +. . .+  p  - 1)  . 
x  n 


Taking  partial  derivative  and  setting  them  to  zero  give 


In  the  above  equations,  e  Is  suppressed.  Since 


fe<y>  - 


_ i  . 

0/e2-y2 


0  <  y  <  e 


otherwise 


and 


'« ■  V’i>  • 


we  see  that  as 


e  -►  0  f ..  “ 


{fjj,  1  >  j)  Is  uniformly  bounded. 


Thus  as  e  ■*  0,  the  equations  become 


—  -  a  -  0 
P 

n 

P«  +  . . .  +  P  -  1 
x  n 


from  which  <P.(e),  P_(e),...,P (e)) 

x-v  z  w  n  -  tin  n 

A  1 

Therefore  the  MLB  Q  puts  mess  —  at  y^,  1  < 


easily  follows. 
1, . . . , n. 
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E 


3.4b.  Inconaistencv  of  MLE. 


LeC 


It  suffices  to  show  that  FA  is  an  inconsistent  estlaate  of  FQ. 

Q  V 


.  n 

Fn(y)  -  ~  I  I{y  <y}  the  sample  c.d.f. 


By  the  Glivenko-Cantelll  theorc 


sup  |f  (y)  -  Fo(y)|-**0  with  probability  1. 

y  ” 


Suppose  F*  is  consistent  in  the  sense  that  F*  -►  F_  with  probability  1. 

Q  Q 

Since  Fq  is  continuous 

sup  |  F~(y)  -  F_(y)  [  -*■  0  with  probability  1 

y 


and  so 


sup  |  F^ (y)  -  F_(y)  1  -*■  0  with  probablity  1. 

Q  n 


But 


F-(y)  -  I  £  Fy  (y) 
Q  i-i  n  yi 

Since  FA(y)  -  1  if  8  <  y 


Fq<y>  •  +  n  ji  Fyi<y)  1{yl>y} 

'  F»<F)  + "  l  VF) 
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Therefore 


Vy)  -  v*)  ■  i  j,  Vy)  Vy,‘ 

By  the  strong  law  of  large  numbers 

F^(y)  -  Fn(y)  -►  E<FY(y)  I{y >  y}>  >  0  where  Y  *  FQ  . 

This  Is  a  contradiction,  so  Q  Is  not  a  consistent  estimate  of  Q. 
Note:  It  Is  remarkable  that  the  restricted  MLE  FA  Is  Inconsistent 

Q 

whereas  the  unrestricted  MLE,  the  sample  c.d.f .  Fq  is  consistent. 
Other  instances  of  this  phenomenon  can  be  found  In  the  literature. 

For  example,  the  restricted  MLE  for  starshaped  and  star-ordered  classes 
of  distribution  Is  inconsistent.  (See  Barlow,  Bartholomew,  Bremner 
and  Brunk,  P.  257-258) . 

3.5.  Minimum  distance  methods. 

In  this  section,  we  will  make  frequent  use  of  the  Identification 

fQ(y)  <-*  h(y|8,+) 

Q<8)  <->  G(8|*) 

|  Fe(y)dQ(8)  -  FQ(y)<-*  H(y|+)  -  j  H(y|8,t)dG(6|t) 


fQ(y)<“*  h(y|+) 


3.5a.  Description. 


i)  Deely  and  Kruse's  approach. 


YltY2,...,Yn  i.i.d.  *  Fq 


where  Qq  •  GQ( *| +)  and  Gq  is  the  true  particle  size  distribution. 

After  observing  the  data  yi»Y2,‘**,^n  *et  ^n  t*ie  8et 
discrete  distributions  with  weights  at  *  *  * ’^n*  %i  m^n^n^zes 

|| Fq— Fn(|  among  all  Q  e  (Jn>  where  Fr  is  the  sample  c.d.f.  This  is 
a  linear  programming  problem. 


ii)  Choi  and  Bulgren's  approach. 

Qh  minimizes  j  (FQ(y)-Fn(y))2  dFn(y) 
among  all  Q  e  Qr.  This  is  a  quadratic  programming  problem. 


Note:  In  Deely  and  Kruse  (1968)  {©n  1§0n  2’**’,0n  n^  are  ie<** 

In  Choi  and  Bulgren  (1968),  {0  . . 0  }  are  part  of  the  argument  of 

n,x  n,n 

the  function  we  seek  to  minimize.  But  here  (0  ,,....0  }  ■  {y, ,...,y  } 

n,i  n,n  x  n 

equals  the  observed  data. 


3.5b.  Consistency  of  Gn(0|+). 

We  need  the  following  lemma. 
Lemma  3.5.1 .  Let 


^1*^2’ *  *  *  *^n' *  *  * 


i.i.d.  'v  h. 


,(yi+)  -  c 

*  V 


— X_dG  (0|+) 


For  almost  every  sample  sequence  (y.  3  there  exists  a  Q*  e  Q  for  each 

ic  n  n 

n  such  that 


0 

QJ->qo 


and 
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Proof.  See  A. 2  of  the  Appendix 

1)  Consistency  of  Deely  and  Kruse's  estimator.  With  the  aid  of 

Lemma  3.5.1,  the  same  proof  In  their  paper  goes  through. 

11)  Consistency  of  Choi  and  Bulgren's  estimator.  By  Lemma  3.5.1, 

for  almost  every  sample  sequence  {y^},  there  exists  Q*  e  Qn  such 

||  -*■  0  (since  F  is  continuous) 

0  Q0 

•=>  | f F _ .  —  F  ||  -*■  0  where  F  is  the  sample  c.d.f. 

11  Q*  n"  n 

| <Fq*<y>  "aw 

1  “VF""2  *°- 

By  definition 


that  11  Fo*  "  F< 


<F~  (y)  -Fn(y))2 


dFn(y)  < 


(F^y)  -Fn(y))2dFn(y)  +  0  . 


By  the  lemma  on  P.  453,  Choi  and  Bulgren  (1968) 


tflFft  "F  ||  -  V  <  3  (  (Fa  (y)  ~  F  (y))2dF  (y)  -  0 
Qn  “  n  J  Qn  n 

->  ||  F/v  -F  II  -*■  0 
Qn  n» 

II  Fa  -F  ||  -  0  . 

Qn  Q0 

By  Theorem  2,  Robbins  (1964) 


% 
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3.5c.  A  difficulty  in  proving  consistency  of  G^. 


We  proved  that  0^(0  ■  Gn(*|+)  is  consistent  in  the  sense  that 
with  probability  1,  Gn(*K)  — >  (3.1.2) 


1  -  Go(0-) 


C  Ws|t) 

r  |do0(ci+) 


So  1  -  G  (0-) 
n 


Since  -r  is  continuous  and  bounded  on  [0,00) 


O^c-lt)  G0C •  i  +)  -*  j  idGn(5|+) 


•j“  id00C5|+) 


Thus  the  numerator  of  1-G  converges  to  the  numerator  of  1-Gq.  But 
4  has  a  singularity  at  5*0  so  that  Gn(»|+)  — ►  Gq(»|+)  does  not 


necessarily  imply 


f0  ?*„«•«  • 


Thus  the  denominator  may  not  converge  to  the  right  value.  Therefore 

A  if  £• 

G  (»!+)  — ►  Gg(*|t)  does  not  necessarily  imply  G  — ►  GQ. 


3. 5d. Modification  leading  to  consistent  estimate  of  Gq. 

In  the  previous  section,  we  see  that  the  problem  is  that  we  cannot 
Insure 


(3.5.1)  ^  |  deceit)  +  ^  |dG0(?l+) 


If  (3.5.1)  can  be  insured,  then  G  (•!+)  G_(*|f)  G  —*■  Gn.  This 

n  u  1  n  u 

motivates  the  following  procedures 
i)  Modified  Deely  and  Kruse 


minimize 


ii  -  f  ii + 1  r  7  dQ(?>  -  -  - 1  — i 

"  Q  n11  ‘j0  5  IT  n  i£i  y±l 


among  all  Q  e  Q^.  Again,  this  is  a  linear  programming  problem, 
ii)  Modified  Choi  and  Bulgren. 

minimizes  |(FQ(y)-Fn(y))2dFn(y)  +  (j"  |-dQ(0-|-^  l  “) 

This  is  a  quadratic  programming  problem. 

To  prove  consistency  results,  we  need  the  following: 

Fact  3.5.1. 

\  ^  y^-  dQ0(5)  with  probability  1  . 

Proof.  By  (3.1.2)  and  (3.1.4) 

C  W°-C  eV!"-;  £ 

so  that  fact  3.5.1  follows  from  the  strong  law  of  large  numbers. 
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C  T  *  f0  I  dV» 


by  fact  3.5.1.  in  other  words,  (3.5.1)  Is  satisfied.  Together,  these 

/s  t' 

imply  Gr  Gq  with  probability  1. 

The  proof  of  the  consistency  of  the  modified  Choi  and  Bulgren's 
procedure  is  analogous  and  will  be  omitted. 


3.5e.  Comparison  with  existing  methods. 


1)  In  the  next  section  we  will  present  simulation  results  which  are 

indicative  of  the  better  performance  of  the  minimum  distance  method. 

ii)  The  minimum  distance  method  actually  gives  an  estimate  Gq  of 

G-.  The  back  substitution  estimate  G  is  not  even  a  d.f.  There  is 
o  n 

no  reason  to  believe  that  the  product  integration  estimate  Gn  is  a 
d.f.  either. 


ill)  An  equivalent  way  to  express  ii)  is  that  the  minimum  distance  method 
takes  into  account  the  structure  of  the  problem  so  that  F  ■  F*  e  { F_ } 

n  Qn  Q 

whereas  F&  and  FQ  t  iF^}.  The  necessity  to  have  an  estimate  belong 
to  (Fq)  is  magnified  when  we  come  to  the  section  on  the  bootstrap 
(section  3.7). 


3.6.  Simulation  results. 

Case  1.  Qq(6)  ■  Gq(0|+)  puts  weight  0.2  at  0  -  1,2, 3, 4, 5.  Sample 

size  n  ■  100. 

Note:  i)  We  assume  that  the  support  points  1,2, 3, 4, 5  are  known  so  that 
we  have  a  four  parameter  problem  instead  of  an  infinite  parameter  problem 
as  in  the  fully  nonparaaetric  case. 
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ii)  In  this  csss  of  a  finite  mixture  with  known  support ,  the  MLE 
will  be  consistent. 

ill)  The  minimum  distance  method  is  Implemented  without  the  modification 
described  in  section  3.5d. 

lv)  In  computing  Choi  and  Bulgren's  estimate,  we  do  not  need  to 
solve  a  quadratic  programming  problem.  Without  the  nonnegativity  constraint 
the  solution  is  just  the  solution  of  a  linear  system  of  5-1  -  4  equations. 

The  solution  to  the  linear  system  will  usually  satisfy  the  nonnegativity 
constraint  automatically  because  we  expect  the  estimate  to  be  near  the 
true  value  of  0.2  as  the  sample  size  increases, 
v)  Let 

(3.6.1)  y  «  [  0  dG(0)  . 

>0 

By  (3.1.4) 

(3.6.2)  W-f-rS”1 -  . 

f  ~  h(y|  +)  dy 
J 0  y 

MLE  and  minimum  distance  method  estimate  y  by  replacing  dG(6)  by 

dG  (8)  in  (3.6.1).  Back  substitution  and  product  integration  estimate 
n 

y  by  replacing  h(y|+)  dy  in  (3.6.2)  by  dHQ(y)  and  dHQ(y)  respectively. 

The  results  are  summarized  in  tables  3,4,5. 

Case  2.  Gq(0)  puts  mass  0.2  at  0  ■  1,2, 3, 4, 5.  Sample  size  n  ■  100. 

The  results  are  summarized  in  tables  6,7,8. 

Note:  In  cases  1  and  2,  we  have  cheated  a  little  bit  for  we  assume  we  know  the 

support  points  0  ■  1,2, 3, 4,5.  When  they  are  not  known,  the  nonpar ametrlc 
MLE  is  Inconsistent  (section  3.4).  However,  the  minimum  distance  estimate 
is  still  consistent  (section  3.5). 
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In  spite  of  this,  we  cen  still  ssy  the  following: 

1)  The  s Initiation  results  are  still  Indicative  of  better  perfornance 

by  the  minimum  distance  method  over  existing  methods  since  II  takes  Into 

account  the  structure  of  the  problem.  This  Is  confirmed  In  case  3. 

11)  For  the  thin  slice  ease,  If  GQ  is  discrete  with  finite  support, 

we  can  in  fact  know  the  points  of  support  from  the  data  since  at  those 

values,  we  should  have  repeated  measurements  with  high  probability  for 

large  samples,  (see  sections  4. 5,4. 8). 

0  02 

Case  3.  g(0)  -  —5-  exp( - r)  ,  6  >  0  , 

c  2c 

that  is,  the  stationary  distribution,  c  ■  4,  n  ■  100. 

Note:  1)  This  is  a  "true"  test  of  the  minimum  distance  method  a  lace  we 
compute  it  "honestly"  here  without  the  Incorporation  of  additional 
information  concerning  G. 

ii)  We  use  Deely  and  Kruse's  approach, 
ill)  After  averaging  over  100  trials,  Anderssen  and  Jakasun's  estimate 
Gn  seems  to  be  increasing.  But  In  a  substantial  number  of  the  100  trials, 
Gn  Is  not  increasing.  They  are  not  reproduced  here  for  lack  of  space. 

The  results  summarized  In  tables  9  and  10  are  still  in  favor  of  the 
minimum  distance  method. 
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6(8 | +)  puts  mass  0.2  at  8  -  112,3,4,5 
Sample  size  «*  100 
planar  data 

TABLE  3 

The  average  over  100  trials  of  the  Choi  and  Bulgren  estimate  and  the 
maximum  likelihood  estimate  of  the  probability  mass  functions  P(8) 
and  P(8|+)  and  their  m.s.e.  (shown  on  even  rows). 


8 

P(8) 

C&B 

MLE 

P(0|+) 

C&B 

MLE 

1.0000 

0.4380 

0.4283 

0.4186 

0.2000 

0.1977 

0.1906 

0.0075 

0.0059 

0.0027 

0.0021 

2.0000 

0.2190 

0.2215 

0.2264 

0.2000 

0.2008 

0.2032 

0.0058 

0.0043 

0.0046 

0.0034 

3.0000 

0.1460 

0.1499 

0.1505 

0.2000 

0.2026 

0.2020 

0.0029 

0.0023 

0.0048 

0.0037 

4.0000 

0.1095 

0.1152 

0.1147 

0.2000 

0.2083 

0.2049 

0.0017 

0.0011 

0.0051 

0.0031 

5.0000 

0.0876 

0.0851 

0.0898 

0.2000 

0.1907 

0.1993 

0.0010 

0.0007 

0.0041 

0.0027 

« 


e 
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TABLE  4 


A  comparison  between  Anderssen  and  Jakeman's  estimate,  Choi  and  Bulgren's 
estimate  and  the  maximum  likelihood  estimate  of  6(6)  in  terms  of  m.s.e. 
(shown  on  even  rows)  and  the  average  of  the  estimates  over  100  trials. 


0 

G(0) 

A&J 

C&B 

MLE 

0.5000 

0.0000 

-0.0658 

0.0000 

0.0000 

1.0000 

0.4380 

0.3100 

0.4283 

0.4186 

32.1618 

15.5362 

14.1427 

1.5000 

0.4380 

0.3738 

0.4283 

0.4186 

24.3492 

15.5362 

14.1427 

2.0000 

0.6569 

0.5802 

0.6498 

0.6450 

14.7763 

7.9217 

6.8735 

2.5000 

0.6569 

0.6211 

0.6498 

0.6450 

10.5440 

7.9217 

6.8735 

3.0000 

0.8029 

0.7382 

0.7997 

0.7956 

9.7252 

4.4107 

4.2060 

3.5000 

0.8029 

0.7872 

0.7997 

0.7956 

5.5502 

4.4107 

4.2060 

4.0000 

0.9124 

0.8877 

0.9149 

0.9102 

4.0513 

2.7474 

2.3778 

4.5000 

0.9124 

0.9104 

0.9149 

0.9102 

3.1911 

2.7474 

2.3778 

5.0000 

1.0000 

1.0000 

1.0000 

1.0000 

0.0000 

0.0000 

0.0000 

TABLE  5 

A  comparison  of  the  back  substitution  estimate,  Anderssen  and  Jakeman's 
estimate,  Choi  and  Bulgren's  estimate  and  the  maximum  likelihood  estimate 
of  the  mean  sphere  radius  y  in  terms  of  m.s.e.  (shown  on  second  row)  and 


the  average  of  the  estimates  over  100  trials. 


y 

2.1896 


B.S. 

2.2850 

0.0771 


A  &  J 

2.3965 

0.1067 


C  &  B 

2.2073 

0.0302 


MLE 

2.2305 

0.0271 


Note:  B.S.  ■  Back  substitution,  A&J  •  Anderssen  &  Jakeman,  C&B ■  Choi  and  Bulgren 


G(8)  puts  mass  0.2  at  0  -  1,2. 3. 4. 5 
Sample  size  •  100 
Planar  data 

TABLE  6 

The  average  over  100  trials  of  the  Choi  and  Bulgren  estimate  and  the 
maximum  likelihood  estimate  of  the  probability  mass  functions  P(6) 
and  P(0|+)  and  their  m.s.e.  (shown  on  even  rows). 


0 

P<0) 

C&B 

MLE 

P(0|+> 

C&B 

MLE 

1.0000 

0.2000 

0.2262 

0.2161 

0.0667 

0.0810 

0.0751 

0.0145 

0.0080 

0.0024 

0.0013 

2.0000 

0.2000 

0.1930 

0.1927 

0.1333 

0.1316 

0.1303 

0.0096 

0.0052 

0.0042 

0.0023 

3.0000 

0.2000 

0.2017 

0.1992 

0.2000 

0.2063 

0.2013 

0.0073 

0.0060 

0.0072 

0.0058 

4.0000 

0.2000 

0.1865 

0.1916 

0.2667 

0.2540 

0.2576 

0.0039 

0.0031 

0.0059 

0.0048 

5.0000 

0.2000 

0.1926 

0.2003 

0.3333 

0.3271 

0.3357 

0.0023 

0.0022 

0.0043 

0.0041 

TABLE  7 


A  comparison  between  Anderssen  and  Jakeman’s  estimate,  Choi  and  Bulgren's 
estimate  and  the  maximum  likelihood  estimate  o£  G(6)  in  terms  of  m.s.e. 
(shown  on  even  rows)  and  the  average  of  the  estimates  over  100  trials. 


6 

G(8) 

A&J 

C&B 

MLE 

0.5000 

0.0000 

-0.0613 

0.0000 

0.0000 

1.0000 

0.3000 

0.1269 

0.2262 

0.2161 

59.1435 

47.2917 

34.6940 

1.5000 

0.2000 

0.1655 

0.2262 

0.2161 

64.7058 

47.2917 

34.6940 

2.0000 

0.4000 

0.3004 

0.4192 

0.4089 

37.2330 

18.1612 

15.1627 

2.5000 

0.4000 

0.3764 

0.4192 

0.4089 

25.3233 

18.1612 

15.1627 

3.0000 

0.6000 

0.5397 

0.6210 

0.6081 

16.9577 

9.5857 

8.8996 

3.5000 

0.6000 

0.5915 

0.6210 

0.6081 

11.6811 

9.5857 

8.8996 

4.0000 

0.8000 

0.7670 

0.8074 

0.7997 

7.6166 

4.7007 

4.5665 

4.5000 

0.8000 

0.7859 

0.8074 

0.7997 

7.0196 

4.7007 

4.5665 

5.0000 

1.0000 

1.0000 

1.0000 

1.0000 

0.0000 

0.0000 

0.0000 

TABLE  8 

A  comparison  of  the  back  substitution  estimate,  Anderssen  and  Jakeman's 
estimate,  Choi  and  Bulgren's  estimate  and  the  maximum  likelihood  estimate 
of  the  mean  sphere  radius  |i  in  terms  of  m.s.e.  (shown  on  second  row)  and 
the  average  of  the  estimates  over  100  trials. 


»  B.S.  A&J  C&B  MLE 

2.9262  2.9673 

0.0751  0.0489 


3.0000  2.9403 

0.2022 


3.1430 

0.1369 


2 

g(8)  -  r|  e-6  ^32,  the  stationary  distribution 
Sample  sire  »  100 
Planar  data 

TABLE  9 

A  comparison  between  And era sen  and  Jakeman's  estimate  and  Deely  and 
Kruse's  estimate  of  G(0)  in  terms  of  m.s.e.  (shown  on  even  rows)  and 
the  average  of  the  estimates  over  100  trials. 


0  G(0) 

1.2812  0.0500 

1.8362  0.1000 

2.2805  0.1500 

2.6722  0.2000 

3.0341  0.2500 

3.3784  0.3000 

3.7128  0.3500 

4.0431  0.4000 

4.3739  0.4500 

4.7096  0.5000 

5.0549  0.5500 

5.4149  0.6000 

5.7961  0.6500 

6.2070  0.7000 

6.6604  0.7500 

7.1765  0.8000 

7.7915  0.8500 

8.5839  0.9000 

9.7910  0.9500 


A&J  D&K 

0.0209  0.0978 

278.295  163.940 

0.0451  0.1230 

138.269  95.6568 

0.1088  0.1635 

71.2289  53.2543 

0.1344  0.2089 

59.7322  45.6907 

0.2296  0.2599 

48.6981  39.2419 

0.2596  0.3139 

35.3034  28.1828 

0.3352  0.3589 

23.2890  22.2477 

0.3663  0.3933 

30.8967  23.5758 

0.3775  0.4267 

34.1915  24.7209 

0.4472  0.4903 

26.0370  22.0270 

0.4884  0.5429 

31.3737  20.9288 

0.5622  0.5871 

16.8953  11.8931 

0.6039  0.6239 

13.9161  11.6400 

0.6828  0.6906 

10.5103  10.5589 

0.6928  0.7345 

15.0637  9.2468 

0.7783  0.7847 

7.7646  5.9378 

0.8226  0.8244 

7.3098  6.3145 

0.8986  0.8964 

2.9213  2.7481 

0.9413  0.9379 

3.1622  2.7778 
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TABLE  10 


A  comparison  of  the  back  substitution  estimate,  Anders sen  and  Jakeaan'a 
estimate  and  Deely  and  Druse's  estimate  of  the  mean  sphere  radlue  y 
In  terms  of  m.s.e.  (shown  on  second  row)  and  the  average  of  the  estimates 
over  100  trials. 

W  B.S.  A&J  DAK 

5.0133  5.0382  5.3666  5.0068 

1.0194  0.8847  0.4658 

Note:  DAK  *  Deely  and  Kruse. 

3.7.  Bootstrap. 

3.7a.  Why  la  It  necessary? 

The  distributions  of  atereological  estimators  are  notoriously  hard 
to  find.  Take  the  simplest  case  of 


«  XI  m 

i  J  i  , 

n  £  Y  * 
n  l-i  *i 


since  E(ry)  -  °°,  T  has  Infinite  variance.  Watson  (1971)  showed  that  T 
Yz  **  n 

Is  asymptotically  normal,  but  the  approximation  is  not  usable  for  two 
reasons:  1)  the  normalizing  factor  and  asymptotic  variance  depends  on 
whether 


|  dG(0)  <  « 


or  not. 

11)  Monte  Carlo  experiments  showed  that  the  normal  approximation  was 
very  poor  when  n  -  100. 
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a 


m 


Bloomfield  showed  that  by  simply  omitting  the  first  term  of  the 

classical  estimator,  the  resulting  estimator  ^  y~^ —  k®8 

n,  1 

variance  even  though  it  Is  biased. 

Jakeman  and  Scheaffer  (1978)  found  conditions  under  which 


1 

n 


have  the  same  asymptotic  distribution.  Since  Anderssen  and  Jakeman’s 
trapezoidal  product  integration  estimate  is  sandwiched  in  between  the  above 
two  estimates,  it  also  has  the  same  asymptotic  distribution. 

Jakeman  and  Schaeffer  also  simulated  SO  independent  samples  of  size 
150,  300,  and  2000.  They  compare  the  theoretical  asymptotic  variance 
with  the  variance  obtained  from  the  fifty  samples.  Three  distributions 
are  tried.  For  the  classical  estimate 


there  is  no  agreement  even  with  n  ■  2000.  For  Anderssen  and  Jakeman' s 
estimate,  there  is  reasonable  agreement  for  two  of  the  three  distributions 
but  only  at  n  «  2000. 

In  conclusion,  stereologists  so  far  have  not  been  able  to  find  a 
usable  and  satisfactory  approximation  to  the  distribution  of  a  stereo- 
logical  estimate.  As  a  result,  they  cannot  do  any  inference. 

3.7b.  Introduction. 

In  an  intuitive  and  non  rigorous  manner,  we  will  indicate  how  the 
bootstrap  can  be  profitably  used.  Also,  we  will  indicate  the  wrong  way  and 
the  right  way  to  bootstrap. 
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Recall  that 


YrY2 . Yq  i.i.d.  *  PQ  -  j  F0  dQ(6)  . 

Since  E(— =•)  »  the  variance  Is  Infinite  so  that  the  usual  central  limit 
Y^ 

theorem  does  not  hold.  Watson  (1971)  shows  that 
1)  If  |  dG(9)  <  ® 

/sF  i  jL  rt  - 6>  *  ?  dG(e» 

where  Q  and  G  are  related  by  Q(6)  *  G(0|+),  (Section  1.6)  and 


B 


TT 

2H 


and 


y 


■r 


0  dG(0)  . 


11)  If 


rj 


d G(0)  **  00 ,  then 


jjjj— (£  |  ~  -  3)  ^  N(0,  *|^)  . 

log  n  n  ^  Ya  8y 


Usually,  when  we  bootstrap,  we  resample  YJ,Y*, ...,Y*  from  Fq, 
the  sample  c.d.f .  In  fact,  in  many  cases,  we  can  resample  from  any 
consistent  estimate.  However,  In  our  case,  as  we  shall  see  shortly,  we 
have  to  be  more  careful  regarding  which  estimate  of  F  to  use. 


Resampling  from  the  sample  c.d.f.  Fft  does  not  work.  Intuitively, 

the  reason  is  the  following:  If  yJ,Y2»«..»Y*  l.i.d.  ^  Fq  then 

E(— -)  <  00  .  So  the  usual  central  limit  theorem  holds.  The  normalizing 

Y*2^  j — — — 

constant  would  be  i/n  instead  of  A -  or  ■= -  .  Thus  we  would  not 

/  log  n  log  n 

expect  the  bootstrap  to  work.  This  is  indeed  supported  by  simulation 
results  which  we  will  discuss  later. 

3.7d.  The  right  way  to  bootstrap. 

00  J 

Assume  /  g-  dG(0)  <  <*>,  we  resample  from  e  consistent  estimate 

A 

F  £  IF„}  (this  is  the  case  for  the  minimum  distance  method) . 
n  Q 

Let  F  “  Fa  . 
n  Qn 

A  oo  ^ 

Since  is  typically  discrete  with  finite  support,  -g-  dGn(0)  <  °°. 

Thus  if  Y*,Y*,...,Y*  i.i.d.  'V-Fa  ,  then  for  each  fixed  n,  as  m->.  » 

m  Qn 

/dh  i  fj  -  E<^,)  * N<0’  w  £  £  d6»(f>» 

where 


y*  -  [  0  dG  (0) . 

'  0  n 

Hence  it  is  plausible  that  bootstrapping  in  this  manner  will  work.  Simula 
tlon  results  seem  to  support  this.  The  above  argument  is  not  rigorous; 
the  missing  step  is  some  kind  of  uniformity  argument  which  enables  us  to 
go  from  (fixed  n,  m  «°)  to  (m(n)  ■  n,  n  -*■  “) .  This  step  is  often  the 
essential  but  hardest  step  in  proving  results  concerning  the  bootstrap. 


But  what  makes  the  bootstrap  attractive  despite  the  limited  theoretical 
results  is  the  large  number  of  examples  where  the  bootstrap  seems 
to  work,  including  complicated  situations  where  there  is  no  standard  solu¬ 
tion.  Besides  suggesting  another  application  of  the  bootstrap,  our  present 
example  illustrates  another  point,  namely,  we  have  to  be  careful  in  boot¬ 
strapping,  contrary  to  what  it  appears  to  be,  "Bootstrap"  is  not  a  cookbook 
method. 

00 

Note:  We  assume  /_  —  dG(8)  <  00  which  admittedly  is  a  limitation.  However 

u  o 

when  there  is  truncation  (only  y  >_  y^  is  observed)  this  condition  is 
automatically  satisfied.  For  details,  see  section  3.8. 


3.7e.  Simulation  results. 

Case  1.  G(6|+)  puts  mass  0.2  at  0  ■  1,2, 3, 4, 5.  Sample  size  n  ■  100. 


Let 


i  “ 

*1  -  ¥  X 


1  n 


7^7 ' «7> 

n,  i 


For  both  and  T2,  we  bootstrap  the  following  quantities:  variance 

and  the  five  quantiles  Q>10»  Q>25»  ^.50’  ^.75  and  ^.90' 

i)  The  "true"  values  which  are  obtained  by  generating  1,000  independent 
samples  of  size  100  are  as  follows: 

For 

var  ■  0.0133 

Q  10  -  -0.1146  Q  25  -  -0.0730  Q  50  -  -0.0260 

Q>75  “  0.0352  Q  9Q  -  0.1067 
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For 


0.0042 


var 

Q>10  -  -0.144S  Q.25  -  -0.1072 

Q  ?5  -  -0.0281  Q  90  -  0.0217 


Q  5Q  -  -0.0718 


11)  Watson's  asymptotic  theory.  If 


then 


^  dG(0>  <  “ 


N(0, 


log  n  1 

/dh  t2  * N(0- 


|  dG(0)) 
^dG(0))  . 


In  our  case 


f  dG(6) 


0.6410 


V  -  2.1898  . 


From  these,  an  estimate  of  var  as  well  as  var  T2  Is  0.00674. 

Estimates  of  quantiles  of  both  T^,  T2  are  -0.1051,  -0.0554,  0,  0.0554, 
0.1051.  Thus  the  normal  approximation  Is  unsatisfactory,  especially  T2  . 

Ill)  Resample  from  the  sample  c.d.f.  (the  wrong  way  to  bootstrap).  The 
results  are  summarized  In  Tables  11,  12.  We  can  see  that  the  bootstrap  esti¬ 
mates  vary  greatly  from  trial  to  trial  and  they  do  not  seem  to  center  around 
the  true  value.  Thus  simulation  Is  In  agreement  with  the  conclusion  of 
section  3.7c;  resampling  from  the  sample  c.d.f.  does  not  work, 
lv)  Resample  from  MLE  (the  right  way) . 
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Not*:  It  may  be  more  fitting  to  choose  minimum  distance  estimates  instead 
of  the  MLE.  However,  the  conjecture  of  section  3.7d  is  that  we  can  re¬ 
sample  from  a  consistent  estimate  £  satisfying  F  £  {F^} .  And  in 

n  n  Q 

our  present  case  of  finite  mixture  with  known  support,  the  MLE  is  con¬ 
sistent.  So  the  MLE  will  serve  equally  well  the  purpose  of  verifying 
the  conjecture  numerically.  The  results  are  summarized  in  tables  13 
and  . 

The  results  are  not  spectacular,  but  are  much  better  than  when  we 
resample  from  the  sample  c.d.f.  The  estimates  are  much  more  stable,  they 
do  not  vary  that  much  from  trial  to  trial.  As  to  whether  they  center  around 
the  true  values,  the  bootstrap  is  quite  successful  for  T^,  while  it  seems 
to  fail  for  var^)  but  this  is  to  be  expected  since  T.  has  infinite 
variance  even  though  it  has  finite  asymptotic  variance. 

v)  We  attempt  to  find  out  whether  the  failure  of  resampling  from 
the  sample  c.d.f.  Fr  is  due  to  (i)  Fq  t  (Fq),  the  proposed  explana- 

tlon,  or  just  (ii)  E(— x)  -  —  7  — —  has  infinite  variance. 

Y  n  i-1  Yi 

We  throw  away  y  .  and  resample  from  the  resulting  sample  c.d.f., 

Ilj  1 

then  E(— )  -  — \  - -  has  finite  variance. 

1  n— 1  Y  . 

i*2  n,i 

The  results  are  in  tables  15  and  16,  even  though  there  are  only  two 
trials,  it  is  quite  evident  that  the  bootstrap  does  not  work  well  in 
this  case.  Thus  it  appears  the  problem  is  due  to  Ffl  t  (f^} . 


TABLE  11 


Bootstrap  estimates  of  the  variance  and  the  0.1,  0.25,  0.5,  0.75,  0.9 
quantiles  of  Tj^  -  —  ■—  -B,  10  trials,  sample  size  -  100. 

G(6|+)  puts  mass  0.2  at  6  -  1,2, 3, 4, 5  with  resampling  from  the  sample 
c .  d  •  f . 

TRUE  VALUES  (based  on  1000  independent  samples) 


Var 

Q0.1 

Q0.25 

Q0.50 

Q0.75 

Q0.90 

0.0133 

-0.1146 

-0.0730 

-0.0260 

0.0352 

0.1067 

BOOTSTRAP  ESTIMATES  (resample  1000 

times  from  the  sample  c.d 

0.0042 

-0.0788 

-0.0435 

-0.0015 

0.0422 

0.0857 

0.0130 

-0.1386 

-0.0791 

-0.0083 

0.0740 

0.1512 

0.0027 

-0.0659 

-0.0347 

-0.0017 

0.0339 

0.0667 

0.0179 

-0.1563 

-0.0925 

-0.0102 

0.0868 

0.1811 

0.0025 

-0.0616 

-0.0326 

-0.0010 

0.0335 

0.0665 

0.0021 

-0.0577 

-0.0230 

-0.0022 

0.0310 

0.0608 

0.0064 

-0.0945 

-0.0552 

-0.0053 

0.0506 

0.1062 

0.0196 

-0.1591 

-0.1152 

-0.0130 

0.0849 

0.1883 

0.0016 

-0.0501 

-0.0271 

-0.0016 

0.0258 

0.0513 

0.0022 

-0.0599 

-0.0300 

0.0005 

0.0331 

0.0633 

TABLE  12 

Bootstrap  estimates  of  the  variance  and  the  0.1,  0.25,  0.5,  0.75,  0.9 
quantiles  of  Tj  ■  ^i-2  - 10  trials,  sample  size  100. 

i  n»  1 

G(0|+)  puts  mass  0.2  at  9  ■  1,2, 3, 4, 5  with  resampling  from  the  sample 
c.d.f . 


TRUE  VALUES  (based  on  1000  independent  samples) 


Var 

Q0.1 

Q0.25 

Q0.50 

Q0.75 

Q0.90 

0.0042 

-0.1445 

-0.1072 

-0.0718 

-0.0281 

0.0217 

BOOTSTRAP  ESTIMATES  (resample  1000  times  from  the  sample  c.d, 

0.0037 

-0.1068 

-0.0744 

-0.0366 

0.0071 

0.0478 

0.0117 

-0.1930 

-0.1389 

-0.0756 

0.0040 

0.0809 

0.0024 

-0.0875 

-0.0587 

-0.0276 

0.0074 

0.0372 

0.0148 

-0.2226 

-0.1670 

-0.0978 

-0.0087 

0.0849 

0.0023 

-0.0838 

-0.0551 

-0.0234 

0.0106 

0.0421 

0.0020 

-0.0774 

-0.0519 

-0.0247 

0.0086 

0.0374 

0.0051 

-0.1317 

-0.0970 

-0.0554 

-0.0081 

0.0444 

0.0114 

-0.1888 

-0.1609 

-0.1231 

-0.0472 

0.0552 

0.0014 

-0.0665 

-0.0465 

-0.0223 

0.0032 

0.0276 

0.0022 

-0.0797 

-0.0492 

-0.0191 

0.0139 

0.0434 
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TABLE  13 


Bootstrap  estimates  of  the  variance  and  the  0.1,  0.25,  0.5,  0.75,  0.9 
quantiles  of  T^  •  ^  “  fl»  10  trials,  sample  size  •  100,  G(0|+) 
puts  mass  0.2  at  6  -  1,2, 3, 4, 5  with  resampling  from  the  MLE. 


TRUE  VALUES  (based  on  1000  Independent  samples) 


Var 

*0.1 

Q0.25 

Q0.5 

Q0.75 

Q0.90 

0.0133 

-0.1146 

-0.0730 

-0.0260 

0.0352 

0.1067 

BOOTSTRAP  ESTIMATES  (resample  1000  times  from  the  MLE) 

0.0228 

-0.1081 

-0.0722 

-0.0210 

0.0523 

0.1283 

0.0225 

-0.1057 

-0.0687 

-0.0216 

0.0511 

0.1261 

0.0197 

-0.0940 

-0.0616 

-0.0172 

0.0464 

0.1090 

0.0235 

-0.1147 

-0.0749 

-0.0189 

0.0552 

0.1276 

0.0217 

-0.1049 

-0.0652 

-0.0163 

0.0409 

0.1248 

0.0173 

-0.0814 

-0.0560 

-0.0137 

0.0406 

0.0981 

0.0202 

-0.1000 

-0.0643 

-0.0189 

0.0481 

0.1131 

0.0198 

-0.0979 

-0.0610 

-0.0161 

0.0447 

0.1122 

0.0180 

-0.0891 

-0.0597 

-0.0166 

0.0420 

0.1070 

0.0230 

-0.1072 

-0.0709 

-0.0200 

0.0525 

0.1317 

AVERAGE  OVER  10  TRIALS 


TABLE  14 


Bootstrap  estimates  of  the  variance  and  the  0.1,  0.25,  0.5,  0.75,  0.9 

quantiles  of  T,  “  — £  n,  - - 0,  10  trials,  sample  size  ■  100, 

n-^  n,i 

G(0 | +)  puts  mass  0.2  at  0  ■  1,2, 3, 4, 5  with  resampling  from  the  MLE. 


TRUE  VALUES  (based  on  1000  independent  samples) 


Var 

%.l 

Q0.25 

Q 

0.50 

Q0.75 

Q0.90 

0.0042 

-0.1445 

-0.1072 

-0.0718 

-0.0281 

0.0217 

BOOTSTRAP  ESTIMATES  (resample  1000  times  from  the  MLE) 

0.0050 

-0.1428 

-0.1114 

-0.0695 

-0.0180 

0.0273 

0.0048 

-0.1374 

-0.1080 

-0.0688 

-0.0195 

0.0280 

0.0037 

-0.1265 

-0.0967 

-0.0615 

-0.0182 

0.0269 

0.0052 

-0.1501 

-0.1151 

-0.0729 

-0.0205 

0.0284 

0.0044 

-0.1398 

-0.1071 

-0.0669 

-0.0212 

0.0252 

0.0028 

-0.1110 

-0.0858 

-0.0555 

-0.0159 

0.0191 

0.0040 

-0.1315 

-0.1018 

-0.0634 

-0.0202 

0.0239 

0.0037 

-0.1284 

-0.0993 

-0,0624 

-0.0201 

0.0250 

0.0031 

-0.1162 

-0.0930 

-0.0588 

-0.0179 

0.0219 

0.0049 

-0.1435 

-0.1116 

-0.0709 

-0.0196 

0.0255 

AVERAGE  OVER 

10  TRIALS 

0.0042 

-0.1327 

-0.1027 

-0.0650 

-0.0191 

0.0251 

TABLE  IS 


Bootstrap  estimates  of  the  variance  and  the  0.1,  0.25,  0.5,  0.75,  0.9 
quantiles  of  ^  E^^  -  0,  two  trials,  sample  size  •  100,  G(8|+) 

puts  mass  0.2  at  9  -  1,2, 3, A, 5  with  resampling  from  the  sample  c.d.f .  when 
Y  .  Is  thrown  away  from  the  sample. 


TRUE  VALUES  (based  on  1000  Independent  samples) 

T“  Vl  V25  V5  V75  V» 

0.0133  -0.1146  -0.0730  -0.0260  0.0352  0.1067 

BOOTSTRAP  ESTIMATES  (resample  1000  times  from  the  sample  c.d.f.  when  Y 
is  thrown  away  from  the  sample) . 


0.0026  -0.1114  -0.0821  -0.0477  -0.0128  0.0167 

0.0079  -0.1893  -0.1406  -0.0859  -0.0238  0.0358 


TABLE  16 

Bootstrap  estimates  of  the  variance  and  the  0.1,  0.25,  0.75,  0.9 
quantiles  of  T2  -  E^  - 8,  two  trials,  sample  size  -  100,  G(9|t) 

”  n.i 

puts  mass  0.2  at  “  -  1,2, 3, 4, 5  with  resampling  from  the  sample  c.d.f.  when 

Y  .  is  thrown  away  from  the  sample. 
n,x 

Var  Q0 #1  Q0.25  Q0.5  Q0.75  Q0.9 

0.0042  -0.1445  -0.1072  -0.0718  -0.0281  0.0217 

BOOTSTRAP  ESTIMATES  (resample  1000  times  from  the  sample  c.d.f.  when 

Y  !  Is  thrown  away  from  the  sample) . 

u9X 


0.0025  -0.1336  -0.1006  -0.0676  -0.0332  -0.0029 

0.0067  -0.2298  -0.1875  -0.1368  -0.0810  -0.0233 
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Case_2.  g(6)  “  — j  exp (-9  /2c  ),  9  >  0.  The  stationary  distribution 

c 

c  ■  4,  n  ■  100. 

This  case  is  included  since  in  the  preceding  case,  as  in  Case  1 

of  section  3.6,  we  assume  we  know  the  points  of  support.  We  bootstrap 

the  sane  six  quantities  but  only  for  T  -  Z  “  ^ - 8.  Again, 

n"  “  n,i 

the  "true"  values  are  obtained  by  generating  1000  independent  samples 
of  size  100. 

i)  Resample  from  the  sample  c.d.f.  The  results  sunmarlzed  in  Table  17 
are  unsatisfactory. 


TABLE  17 


Bootstrap  estimates  of  the  variance  and  the  0.1,  0.25,  0.5,  0.75,  0.9 

1 


-  8,  10  trials,  sample  size  •  100, 


"U“tll“  °f  *-  &  ri“2  Y  ( 

0  —0^/32  Hgl 

g(9)  *  «  with  resampling  from  the  sample  c.d.f. 


TRUE  VALUES  (based  on  1000  Independent  samples) . 


Var 


<0.1 


<0.25 


<0.5 


<0.75 


<0.9 


0.000940  -0.062868  -0.049041  -0.033234  -0.014582  0.008415 


B00STRAP  ESTIMATES  (resample  1000  times  from  the  sample  c.d.f.) 


0.003491 

0.002176 

0.000208 

0.000238 

0.005164 

0.000537 

0.013839 

0.000500 

0.001743 

0.000485 


-0.102954 

-0.082955 

-0.023839 

-0.025432 

-0.122902 

-0.041454 

-0.210793 

-0.040470 

-0.078458 

-0.039962 


-0.086164 

-0.061658 

-0.016539 

-0.017687 

-0.102485 

-0.028469 

-0.183493 

-0.028547 

-0.061143 

-0.029277 


-0.057824 

-0.032484 

-0.007060 

-0.007264 

-0.072150 

-0.013718 

-0.113879 

-0.013560 

-0.035821 

-0.016017 


-0.012124 

0.000036 

0.003126 

0.003130 

-0.017414 

0.002967 

-0.015015 

0.001731 

-0.006285 

-0.003058 


0.042565 

0.035518 

0.012128 

0.012771 

0.045898 

0.017921 

0.075649 

0.017817 

0.028975 

0.013771 


it)  Resample  from  F-  where  6  Is  some  consistent  estimate  of  Q. 

Qn 

1)  Qq  B  Deely  and  Kruse's  estimate.  The  results  summarized  In 
Table  18  are  still  unsatisfactory. 

We  think  the  failure  is  due  more  to  insufficient  sample  size  rather 
than  the  theory  itself.  To  illustrate  this,  we  consider  the  following 
two  estimates  which  should  be  progressively  better. 

2)  is  Deely  and  Kruse's  estimate  except  that  we  minimize  the 

distance  from  F*  instead  of  F  where  F*  is  the  true  F.  known 

Q  n  Q  Q 

only  up  to  its  values  at  *  *  *  *^n‘ 

The  results  are  in  Table  19. 

3)  Gn  ■  Fn  the  sample  c.d.f.  0^(0  ■  < •  |  + )  . 

Since  we  are  sampling  from  the  stationary  distribution,  G  **  F^ 
so  that  Fr  which  estimates  Fq  consistently  also  estimates  G 
consistently. 

The  results  given  in  Table  20  are  quite  satisfactory. 
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TABLE  18 


Bootstrap  estimates  of  the  variance  and  the  0.1,  0.25,  0.5,  0.75,  0.9 
quantiles  of  T  ■  — ^  Z.n,  - 8,  10  trials,  sample  size  -  100, 

9  n-i  x“z  J 

a  —0/32 

g(9)  “  —  e  '  with  resampling  from  Deely  and  Kruse's  estimate. 

Id 

TRUE  VALUES  (based  on  1000  Independent  samples) 


Var  Q0.1  Q0.25  Q0.5  Q0.75  Q0.9 

0.000940  -0.062868  -0.049041  -0.033234  -0.014582  0.008415 

BOOTSTRAP  ESTIMATES  (resample  1000  times  from  Kruse  and  Deely's  estimate) 


0.001708 

-0.085659 

-0.072052 

-0.052141 

-0.026198 

0.003914 

0.001989 

-0.090900 

-0.073839 

-0.050470 

-0.018998 

0.015578 

0.000434 

-0.043723 

-0.034247 

-0.022845 

-0.007393 

0.007296 

0.000565 

-0.049936 

-0.038930 

-0.024875 

-0.007648 

0.008182 

0.001926 

-0.089928 

-0.075226 

-0.055922 

-0.029547 

0.000282 

0.000938 

-0.063185 

-0.050022 

-0.032467 

-0.010730 

0.013962 

0.005940 

-0.162466 

-0.143522 

-0.114927 

-0.069700 

0.007194 

0.000785 

-0.056717 

-0.045806 

-0.028815 

-0.010053 

0.013531 

0.001100 

-0.066572 

-0.053197 

-0.034692 

-0.013973 

0.012597 

0.000504 

-0.047352 

-0.036867 

-0.023304 

-0.007649 

0.007293 

TABLE  19 


Bootstrap  estimates  of  the  variance  and  the  0.1,  0.25,  0.5,  0.75,  0.9 


quantiles  of  f  * 

JL  e-e2/ 32 
8(0)"l6  e 


n^L  ^1-2  Y~ ^ - B,  10  trials,  sample  size  **  100, 

n,i 

with  resampling  from  Deely  and  Kruse’s  estimate  when 


H  »  Fq  is  known  up  to  its  values  at  y^,...,yn< 
TRUE  VALUES  (based  on  1000  Independent  samples) 


Var 

<0.1 

Q0.25 

Q0.5 

Q0.75 

Q0.9 

0.000940 

-0.062868 

-0.049041 

-0.033234 

-0.014582 

0.008415 

BOOTSTRAP  ESTIMATES  (resample  1000  times  from  Deely  and  Kruse's  estimate 
when  H  ■  Fp  is  known  up  to  Its  values  at  y^, ... , y^) . 

0.000872 

-0.062096 

-0.049394 

-0.031613 

-0.012375 

0.011283 

0.000686 

-0.054561 

-0.042848 

-0.026699 

-0.008555 

0.011001 

0.000574 

-0.049875 

-0.039274 

-0.025585 

-0.007829 

0.009260 

0.000581 

-0.049992 

-0.039118 

-0.024924 

-0.007506 

0.009355 

0.000613 

-0.051867 

-0.040340 

-0.025498 

-0.007999 

0.008047 

0.000907 

-0.060692 

-0.049103 

-0.031294 

-0.010354 

0.013510 

0.000641 

-0.052962 

-0.041608 

-0.026311 

-0.008401 

0.010680 

0.000629 

-0.052042 

-0.041308 

-0.025856 

-0.008568 

0.010410 

0.000629 

-0.052456 

-0.041750 

-0.026272 

-0.008925 

0.009852 

0.000801 

-0.057060 

-0.046070 

-0.029845 

-0.010247 

0.011531 
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TABLE  20 


Bootstrap  estimates  of  the  variance  and  the  0.1,  0.25,  0.5,  0.75,  0.9 

quantiles  of  T  -  ^  Y - P*  10  trials,  sample  size  -  100, 

•”  ^32  with  resampling  from  F„  where  Q  -  G(*|+)  and  G 
°  16  Q 

Is  the  sample  c.d.f. 


TRUE  VALUES  (based  on  1000  Independent  samples) 


Var  %.l  <>0.25  Q0.5  Q0.75 

0.000940  -0.062868  -0.049041  -0.033234  -0.014582 


*0.9 

0.008415 


BOOTSTRAP  ESTIMATES  (resample  1000  times  from  F*  where  Q 

Q 

and  G  Is  the  sample  c.d.f.) 


cM+> 


0.000835 

0.000969 

0.000618 

0.000820 

0.001072 

0.000655 

0.000873 

0.000875 

0.000860 

0.000721 


-0.063257 

-0.067132 

-0.052631 

-0.061422 

-0.069756 

-0.055363 

-0.064044 

-0.060043 

-0.065346 

-0.056667 


-0.050242 

-0.054329 

-0.040382 

-0.047123 

-0.055724 

-0.043477 

-0.051530 

-0.046270 

-0.049600 

-0.043761 


-0.033450 

-0.034629 

-0.026392 

-0.028338 

-0.036793 

-0.028286 

-0.035942 

-0.029765 

-0.033415 

-0.026372 


-0.015003 

-0.012333 

-0.008904 

-0.009079 

-0.015227 

-0.010781 

-0.016808 

-0.009939 

-0.014309 

-0.009678 


0.008661 

0.009553 

0.010853 

0.009107 

0.008884 

0.008964 

0.005584 

0.013792 

0.007078 

0.008854 


3.7f.  Conclusions  and  applications. 


The  simulation  results  are  quite  encouraging.  It  appears  that  the 
bootstrap  methodology  can  be  a  contribution  to  the  stereological  literature. 
This  opens  up  a  lot  of  new  possibilities,  in  particular,  we  can  do  approximate 
inference.  For  example,  an  approximate  confidence  interval  of  E(y)  can  be 
derived  from  P(Qq  <  T2  <  Q1-fl()  -  l-2o  where  we  recall 

n 


IB* 

n^L  A 


T~  -  '<?>  • 


i-2  n,i 

A  naive  way  is  to  replace  and  *he  quantiles  of  T^  by  their 

bootstrap  estimates. 
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3.8.  Truncation. 


If  we  observe  only  y  yg,  following  the  same  argument  as  In  section  2.5 

/2  ? 

y  -y^  instead 

of  T  and  the  distribution  of  /?  -y^  given  0  ^  yg  Instead  of  G(0) . 

Suppose  G(3)  has  density  g(0)  then 


«ce|e  >  y0> 


sSSL 


r 

*  w 


g(0)d6 


g(6)  . 


Let 


So 


c 


«2  -  02  -  A 


2?dC  -  20d0 


d0  K 
d5  "  7 


/r2  2 

^  +yn 


gg(£le  >  y0) 


.  _L 


g0(^2+yj  |0  >  y0) 


One  consequence  Is  that 


CHAPTER  IV 


THIN  SLICE  OF  THICKNESS  2T 


4.1.  The  basic  formulas. 


(4.1.1) 


-  H(yj+)  -  I 

'  V 


JeW 


+  T 


e  +  T 


dG(0|+) 


(4.1.2) 


dO(8|t)  - 

Me 


where  y _  -  / 9  dG(9) .  So 


(4.1.3) 


-  H(y|+)  -  [ 

J  V 


3L-±I 


UG  +  T 


dG(0) 


4.2.  Decomposition  and  lnveraion. 

We  shall  see  In  this  section  that  the  three  cases  a)  G  continuous, 
b)  G  mixed,  c)  G  discrete  have  to  be  treated  separately.  This  fact  seems 
to  be  unnoticed;  all  the  formulas  derived  thus  far  In  the  literature  are 
valid  only  in  the  continuous  case.  The  results  of  4.2b,  4.2c,  and  4. 2d 
are  entirely  new  to  the  best  of  our  knowledge.  Section  4.2a  contains 
no  new  results.  They  can  be  found  in  Coleman  (1979) 

4.2a.  If  G  is  continuous,  then  so  is  H(y|+)  and  we  can  differentiate  (4.1.3) 
to  get 

<4.2.1)  h(y|t)  -2-SM+  f  — 3L_  S»ije 

V*  ]y  ^7  V* 


I 


which  can  be  inverted  to  give 


<«•*•*>  ^  i  j!  h(y|+)dy 


where 


f(o>)  ■  /2tt  e  {l 


--f 

Jr*  I 


»  -iu2 

e  du)  . 


Integrating  (4.2.2) 


>•2.3)  1  -  G(0)  -  J\  f  (-JL^t~9-  }  h(y|+)dy 


Setting  0  »  0  yields  pG  as  a  function  of  H(*|+)» 


(4.2.4) 


•A  -V  f0  h(yi+,d5' 


4.2b.  A  basic  fact. 

H(y|6,i)  has  a  discrete  component  at  y  ■  0,  this  can  be 
explained  physically,  since  the  observed  y  is  the  profile  (maximum) 
radius .  If  we  let  t  be  the  perpendicular  distance  of  the  center  of 
the  sphere  from  the  center  of  the  slice,  then  for  0  £  t  <_  t  ,  the 
observed  y  is  0,  the  radius  of  the  sphere.  Since  H(y|3,t)  has 
a  discrete  component  at  y  ■  0,  so  if  G  put  mass  r.t  0  ■ 
then  H(yjt)  ■  /  H(y|0,+)  dG(0[+)  would  also  put  mass  at  the  same  points 

0  i , .  • . ,  0_  • 

x  m 
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Thus,  if  G  is  not  continuous,  we  cannot  differentiate  (4.1.3)  to  get 

(4.2.1).  We  can  see  that  the  single  equation  (4.2.1)  cannot  be  right  since 

it  would  treat  probability  density  and  probability  (the  mass  put  on 

6. ,...,6  )  on  an  equal  footing, 
l  m 

The  correct  formulas  are  the  following:  Let  G  -  where 

G^  -  continuous  part  of  G 
m 

Gd  ■  £  ®d^®i^0  ~  discrete  part  of  G  ,  m  can  be  00  . 


4.2c.  Assume  G  is  mixed,  i.e.  /  dG£  >  0,  /  dG^  >  0.  We  will  treat  the 
purely  discrete  case  later.  Then  H(*|+)  ■»  Hc  +  and 


(4.2.5) 


i  liG« 


Note:  This  implies  and  G^  have  the  same  jump  points. 


(4.2.6)  hc(y) 


T  «c(y)  ,«  y  8C<0) 

yr+T  +  ^  yr+T 

G  jy  /02-y2  G 


— ■*—  ~rdG.(0) 


(4.2.6)  can  be  rewritten  as 


r  y  1  T  8c(y)  r  v  8c(8) 

(4.2.7)  h  (y)  -  - 2—  i  dK,(0) - -  +  - -2—  d0  . 

^7  T  "  V"  iy  Jj—2  VT 


(4.2.7)  has  the  same  form  as  (4.2.1)  with  h  in  (4.2.1)  replaced  by  the  left 
hand  side  of  (4.2.7),  g  in  (4.2.1)  is  replaced  by  gc.  Thus  (4.2.7)  can  be 
inverted  to 

(4.2.8)  gcW  --/f  ^  df  Cf  thc(y)  -  [  -±=  idHd(5)).dy). 


y  kW 
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1 


Integrating  (4.2.8)  gives 


(4.2.9)  {  40c-Cc(9)  ^  f  f  ~f  7^=7  «d«»  •  <* 


Inverting (4.2. 5)  gives 


I 


(4.2.10) 


Vg+T 


Gd(9)  -  Hd(8) 


So,  except  for  y_  and  /  dG  ,  (4.2.9)  and  (4.2.10)  give  G  and  G.  in 
be  .CO 

terms  of  Hc  and  H^.  Set  0  ■  0  in  (4.2.9),  we  then  have 


(4.2.11)}  dGc  ^  Ihc(y)  -  j  -^fdHd(C)]dy 


r  T  1 


Set  0  -  00  in  (4.2.10)  and  using  the  fact  /  dG  +  /  JG  ■  1  we  have 

c  d 


(4.2.12) 


-  }  dGc  ■  ^  }  dHd  • 


(4.2.11)  and  (4.2.12)  are  two  equations  in  the  two  unknowns  /  dGc  and  y^, 
thus  they  can  be  solved  to  give  /  dGc  and  y^  in  terms  of  H£  and  H^. 


4. 2d.  G  is  discrete. 


G  -  G.  -  l  8i  6@ 
i-1  i 


(4.2.5)  and  (4.2.6)  become 
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(4.2.13) 


H _ L-  G 

d  VT 


(4.2.14) 


Vy)  “  f  jr+t dG(6) 
Jy  AW  v° 


(4.2.13) 


d  VT 


Since 


/  ■„  +  '  Hc  -  1  ,  /  dHc  -  — T 

V 


Normalizing  (4.2.13)  and  (4.2.14)  we  get 


(4.2.15) 


_  hc<y) 

"  /dHc 


(4.2.16) 


-  f  -=*=  ^ 

1 y  c 


Observation  4.2.1:  (4.2.16)  la  just  (3.1.3)  with  h  replaced  by  h£t 
so  as  far  as  Is  concerned,  we  are  back  to  the  random  plane  case. 

A  physical  explanation  of  observation  4.2.1:  Let  A  denotes  a  thin 
slice  of  thickness  2t,  and  the  two  plane  sections  at  the  two  ends 
of  A  as  shown  in  the  diagram. 
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Then  hc(y)  is  just  the  conditional  density  of  y  given  that  a  sphere  is 
hit  by  the  thin  slice  but  the  center  of  the  sphere  lies  outside  the  interior 
of  A  (that  is,  either  hits  the  right  half  of  the  sphere  or  PR  hits 

the  left  half  of  the  sphere).  Thus  h£(y)  is  just  the  conditional  density 
of  y  given  that  a  sphere  is  hit  by  e>  random  plane. 

Note:  For  the  mixed  and  continuous  case,  the  above  argument  falls.  We 
can  no  longer  say  that  hc(y)  is  the  conditional  density  of  y  given  that 
the  left  half  of  a  sphere  hits  PR  or  the  right  half  of  the  sphere  hits 
PL.  Because  of  the  continuity  of  Gc,  the  discrete  part  of  n("|6t+) 
(corresponds  to  when  the  center  of  the  sphere  lies  in  the  interior  of  the 
thin  slice  A)  can  contribute  to  H  ,  the  continuous  part  of  H(* | +) . 

T  sc<y> 

This  contribution  is  represented  by  the  term  ■  *«■  —  in  equation  4.2.6. 

vT 

By  observation  4.2.1,  the  inversion  of  (4.2.16)  has  already  been 
done  in  section  3.1.  Corresponding  to  (3.1.4),  we  have 


(4.2.17) 
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3.  Decomposition  of  data,  Hn  and  F( 


a** 


In  this  section,  we  are  going  to  Introduce  various  decompositions 
that  correspond  to  H  -  H^  +  H^. 

4.3a.  Notations  concerning  the  data. 

Given  the  data  y^»y2» • « •  »yQ»  define  £,(n)  ■  #  of  distinct  y's. 
Denote  the  distinct  y's  by  xi»x2» •  •  •  »xf,(n)  with  corresponding 
empirical  frequencies  ^1*^2*  *  *  *  *  ^et 

G  -  Gc  +  Gd 
m 


Gd  “  8i68l  »  0  “  *81,62 . 9m*  * 

Define  n  -  empirical  frequency  of  6.  in  the  data  y, ,y_,...,y  , 
1  1  I  2  n 


1 


1, 


•  •  •  ,m. 


4.3b.  Decomposition  of  data. 


We  are  going  to  decompose  the  data  according  to  whether  f ^  >  1 


or  not. 


i.  "Discrete"  data  (corresponds  to  f^  >  1).  Let 


"d  -  fi  1{fi>  1} 

"<n>  ■  X  i{f1>  1}  • 


That  is,  n^  is  the  number  of  observations  in  the  "discrete"  data  and 
m(n)  is  the  number  of  "distinct?'  values.  Denote  the  distinct  values  by 

^l’^2*  *  ’  **^m(n)  *  Deflne  4n  “  {*i»*2 . ^(n)** 


ii.  "Continuous"  data  (corresponds  to  f .  ■  1) .  n  ■  n-n ,  is 

i  c  d 

the  number  of  observations  in  the  "continuous"  data.  Denote  them  by 

*l*z2*  *  *  *  **n  * 
c 

Roughly  speaking,  we  are  regarding  the  "discrete"  data  as  a  sample 
from  Hj  and  the  "continuous"  data  as  a  sample  from  H^. 

Note:  When  there  is  measurement  error,  exact  ties  may  not  be  observed. 

We  then  need  to  consider  all  the  values  that  cluster  together  as  tied 
values.  This  would  Increase  the  probability  of  mlsdasslf ication  of  data. 
Of  course,  the  criteria  of  identifying  a  cluster  depends  on  our  knowledge 
of  the  size  of  the  measurement  error. 

4.3c.  Decomposition  of  H0. 

Let 

1  n 

Vy)  "  n  ^ 1  {y  <  y} 

be  the  sample  c.d.f. 

Define 

1  *<?> 

H  «(y)  •  “  y  f  .  I  r  ,  1  '  If,  .1  • 

nd  f  n  ^  i  lxi  <_  y)  lfi>  1) 

For  continuous  distribution,  the  probability  of  observing  the  same  number 
twice  is  zero,  so 

(4.3.1)  $nC0  with  probability  1  . 

Therefore,  an  equivalent  definition  is 

Hnd(y)  ‘ni[1niI{ei<y}I{ni>U 


1 

’  n  i-i  fi  1{xi  £  y} 


H  (y)  -  H  (y)  -  H  .(y) 
nc  7  n  7  nd  7 


1  \°> 

"n  ^f^-l} 

n^ 

" "  Ji  1{zi£y}  * 


S  ,  H-,  H  ,  H  ,  are  all  subdistribution  functions,  we  can  normalize  them 
c  a  nc  na 

to  distribution  functions  H  ,  H.,  H  ,  H  ..  That  is. 

c  d  nc  nd  * 


H  -  H  //dH 
c  c  c 


Hd  -v/dHd 


H  -  H  //dH 
nc  nc  nc 


-  H  /(n  /n) 
nc  c 


’  \  i-1  1{zi  -  y} 


Hnd(y>  -  Hnd"dHnd 


Hnd/(nd/n) 


"  ®a  ii  fiI<xi<y>  * 
i  ■ 

"  “I  >1  tti  ^ei  <  y>  *  IK>1} 
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« 


5 


I 


1 


k  ; 

"a 


4.3d. 


Decomposition  of  F 


2i* 


Recall  the  identification 


H(y|e,+)^pe(y) 
G(0|+)^Q(0) 
H(yl+)«-*-FQ(y)  . 

G  "  Gc  +  Gd  ’  Gd  “  l  8ifi 

i-1 

Q  -  Qc  +  Qd  * 


By  (4.1.2) 


and 


Q<0) 


(C-K)dG(C) 


V* 


Gd 


m  0  +r 

ii  *3*  8l\ 


■  L  ’tV 


i-1 


Equation  (4.2.5)  can  then  be  rewritten  as 


(4.3.2) 
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4.4.  Methods  based  on  inversion  formulas. 

4.4a.  G  continuous. 

Replace  h(y|+)dy  In  (4.2.3),  (4.2.4)  by  dHn(y) .  If  Hn  Is  the 

sain>le  c.d.f.,  we  are  using  classical  back  substitution;  if  H  is  the 

n 

sample  c.d.f.  smoothed  by  piecewise  Lagrange  interpolation,  we  are  using 
Anderssen  and  Jakeman's  approach. 

4.4b.  G  mixed. 

Replace  dH^  by  dH^  and  hc(y)dy  by  dHRc(y)  in  equations 

(4.2.9),  (4.2.10),  (4.2.11)  and  (4.2.12),  where  H  ,  and  Hqc  are  estimates 

of  H^,  Hc  respectively.  For  example,  we  can  let  where 

is  defined  in  section  4.3c  and  H _ is  a  smoothed  version  of  H  . 

nc  nc 

4.4c.  G  discrete. 

Here,  we  have  a  choice  since  we  have  two  Inversion  formulas,  namely 
(4.2.15)  and  (4.2.17). 

We  can  obtain  an  estimate  of  G  by  simply  replacing  Hj  by  an 
estimate.  in  (4.2.15). 

Alternatively,  we  can  replace  h  (y)dy  in  (4.2.17)  by  dHnc(y)  where 

H  is  some  estimate  of  H  . 
nc  c 

For  example,  we  can  use  to  estimate  and  a  smoothed  version 

of  H  to  estimate  H  where  H  and  H  ,  are  defined  in  section  4.3c. 
nc  c  nc  nd 
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A  closer  look  at  the  discrete  case. 


The  stage  is  aet  in  section  4. Ac.  We  see  from  there  that  we  can  use 
either  (4.2.15)  or  (4.2.17)  to  derive  an  estimate  of  G. 

4.5a.  Treating  part  of  thin  slice  data  as  planar  data. 

We  have  more  flexibility  than  is  indicated  in  section  4.4c.  (4.2.17) 

is  just  the  inversion  of  (4.2.16)  and  observation  4.2.1  gives  us  a  random 
plane  Interpretation  of  (4.2.16).  Thus  if  we  can  separate  from  the  data 
y, the  part  which  is  a  sample  from  H  ,  we  can  apply  to  that 
part  of  the  data  all  the  methods  we  have  discussed  in  chapter  III.  A 
reasonable  way  to  separate  the  data  is  to  regard  the  "continuous"  data 
defined  in  section  4.3b. as  a  sample  from  H  .  If  m  is  finite,  a  justi- 
flcatlon  for  doing  so  is 

v  5.1)  P(misclasslflcatlon  of  data)  -*-0  as  n.  •  . 

We  shall  prove  (4.5.1)  later.  Actually,  more  can  be  said  in  the  case  when 
m  is  finite. 

4.5b.  A  basic  question. 

Returning  to  the  point  we  made  earlier,  we  can  proceed  from  either 
(4.2.15)  or  (4.2.16)  (or  its  inversion  (4.2.17))  to  obtain  an  estimate  of  G. 
But  both  equations  contain  information,  thus  we  should  make  use  of  both  of 
them  instead  of  just  one.  Of  course,  we  can  always  take  a  simple  average 
or  weighted  average  of  two  estimates,  one  based  on  (4.2.15)  and  the  other 
on  (4.2.16).  But  are  there  other  ways,  ways  which  are  more  built  into  the 
procedure  for  extracting  the  Information  contained  in  both  equations?  We 


will  Me  iron  section  4.7  that  Che  minimum  distance  method  Is  one  such  way. 
In  the  remainder  of  this  section,  we  will  discuss  another  procedure  which 
uses  both  equations  and  which  has  its  own  appeal. 

4.5c.  Relationship  between  and  6  . 

We  shall  prove  the  following  results  for  the  case  when  G  is  mixed, 
thus  Including  the  discrete  case  as  a  special  case.  Let 

G  •  G_  +  G . 
c  d 


m 


l 

1-1 


8i  V 


9 


m  can  possibly  equal  infinity  . 


Equation  (4.2.5)  tells  us  that  G  and  H  have  the  same  points  of  jump, 
moreover  if  G  put  mass  g^  at  8^,  then  H  put  mass  h^  •  g^  at 

ni  *■ 

0^.  By  the  strong  law  of  large  numbers,  —  -*■  h^  with  probability  1  so 
that  for  almost  sample  sequences,  6^  will  be  included  in  the  "discrete" 
data  if  n  is  sufficiently  large.  In  other  words,  if  we  let 

00 

(4.5.2)  ♦  -  U  ♦  •  note  that  $  C  4  (see  section  4.3b) 

w  •  n  Q  UTI 


(4.5.3) 


P(0i  e  tw)  -  1  for  i  fixed. 


Since  there  can  only  be  countably  many  points  of  jump 


9 


that  Is, 


P(  !\  (0  e  <bj)  -  1 
i-1 

p(e  c  *„>  -  i 


(4.3*1)  •=>  9m  C.  Q  with  probability  1.  Therefore  P  (G  *  *  1.  Since 

we  can  never  observe  the  whole  sample  sequence,  it  would  be  nice  if  we 

can  say  something  about  P($n  “  0)  for  n  large. 

If  m  <  08 ,  (4.5.2)  and  (4.5.3)  together  imply  P(0^  e  $n)  -*■  1  as 
n  -*■  00 .  Since  m  is  finite 

m 

(4.5.4)  P (D  (6,  e  9  ))  -*•  1  as  n  •>  <»  . 

i-1  1  n 

Now 

m 

n  (0.  e  9  )  <=^>  0C  $ 
i-1  1  n  n 

Also,  C  0  with  probability  one.  Therefore 

(4.5.5)  P(0  -  9  )  -*>  1  as  n  •*■<*>  . 

n 

Note  that  since  4^  C  0  with  probability  one,  the  only  way  in  which  data 
can  be  misclassif led  is  to  have  some  6^  not  belonging  to  So 

P(misclassif ication  of  data) 

-  P(  U  (0*  t  *  )) 

i-1  1  n 

m 

-  P((A  (9*  e  9))C)  +  0  by  (4.5.4)  . 

i-1  1 

Thus,  we  have  proven  (4.5.1). 
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If  m 


being  a  finite  set  cannot  equal  0  which  is 


m  oo 

> 

countable.  So 


P($n"0)  -  0  for  all  n  . 


However,  we  have  the  following  result:  Given  e  >  0,  there  exists 

9.  ,0  ,...,0  such  that 

1  2  m 


I  >  i-e 
j-i  *3 


I  q.  >  1-e  . 

j-1  j 


Let 


.9,  } 


» 


then 


P($  Z2  $  )  -*-1  as  n  -*■  00  . 
n  e 

4.5d.  G  discrete,  m  finite  -  further  reduction. 

We  already  mentioned  that  we  can  regard  the  "continuous"  data  as  a 
sample  from  h£,  using  (4.5.1)  as  justification.  We  also  mention  that 
we  can  then  apply  all  the  methods  we  discussed  in  chapter  III  to  the 
"continuous"  data.  In  sections  1.9  and  3.6,  we  see  that  if  we  know 
the  support  of  G,  then  the  computations  of  the  MLE  and  ainlmua  ditu*  . 
estimate  are  much  simplified.  Moreover,  they  give  far  better  mi  tar  •  - 
than  Anderssen  and  Jakeman's  procedure  which  could  not  tee*  .« 

8« 
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the  additional  piece  of  information.  While  it  ie  quite  laposaible  to  deter¬ 
mine  the  support  of  G  from  random  plane  data,  in  our  present  case,  (4.5.5) 
tells  us  that  if  n  is  sufficiently  large,  then  with  large  probability, 
the  support  of  G  can  be  determined  from  the  data.  In  other  words,  thin 
slice  data  contain  more  Information  than  planar  data  and  the  additional 

information  can  be  put  to  good  use.  Essentially,  we  are  using  (4.2.15)  to 

m 

determine  0, ,0-,...,0  ,  the  support  of  G  •  7  g.  and  (4.2.16)  to 

12m  i-1  1  8i 

determine  gi»g2» * • • *8^* 

4.6.  Miscellaneous 

In  this  section  we  state  a  number  of  lemmas  which  will  be  used  in 
section  4.7,  their  proofs  can  be  found  in  A. 3  of  the  Appendix. 

Ishibis  4.6.1.  With  probability  one,  ||Hn-H||  *►  0. 

Lemma  4.6.2.  With  probability  one,  Rh^-H^R  ■*  0. 

Lemma  4.6.3.  With  probability  one,  *  °* 

Lemma  4.6.4.  For  almost  every  sample  sequence,  there  exists  a  Q*  e 

for  each  n  such  that 

A 

Q*  -*Q 

and 

Q*(e±)  -  0*0^)  -  Q(0j)  -  Q(01-) 
for  n  sufficiently  large. 

Note:  See  Section  4.3d  for  the  definition  of  Q^. 

Lemma  4.6.5.  For  almost  every  sample  sequence,  let  Q*  be  as  defined  In 

i_  *  o. 

Note  that  Q  ■  G(*|t),  F^  ■  H(*|+)  and  we  are  in  the  thin  slice  case. 


In  1 


a  4.6.6.  For  almost  every  sample  sequence,  let  Q*  be  as  defined 


Leasts  4.6.7.  For  almost  every  sample  sequence,  let  Q*  be  as  defined 


in 


4.6.4 


Lemma  4.6.8. 


Lemma  4.6.9. 
there  exists 
that  the  atat 


If  F„  *  Fq,  then  *  Q. 

If  G  is  discrete,  then  for  almost  every  sample  sequence 
<3£,  Q*  discrete,  puts  weights  at  ^’^‘"'^mCn)  8Uch 
ments  of  leosaas  4.6.4,  4.6.5,  4.6.6  and  4.6.7  still  hold. 


9 


4.7.  Minimum  distance  method. 

4.7a.  Description. 

Procedure  1.  minimizes  || Fq  - hjj  among  all  Q  E  Q^. 

This  procedure  is  a  linear  programming  problem  with  4l(n)  + 1  equations 
excluding  the  nonnegativity  constraints  and  l(n) + 1  variables. 

Procedure  2.  Among  all  Q  e  Q^,  minimises 

S^-Vi!  +  ll^*Bnd« 

where  +  ■  Fn  is  a  decomposition  c 2  Frt  (see  section  4.3d). 

Q,a  Q  Q 

This  is  a  linear  programming  problem  with  4n  +  2m(n)  + 1  linear  constraints 

c 

and  i(n)+2  variables. 

Procedure  3.  Among  all  Q  e  Q^,  minimizes 

«VV  - 


91 


Not*:  The  eecoad  Integral  can  ba  rewritten  aa 


*  n. 


T  li  t  l  a  ~L\2  t 
J 1  n(^qi  v)  I(n1>l} 


where  ^  >  0  ia  the  weight  Q  put  at  0^. 
This  la  a  quadratic  programing  problem , 


4.7b.  Cona latency  of  Q^. 


For  procedure  1.  For  alaoat  every  aaaple  aeauence.  let  Q*  be 


ee  defined  In  lame  4.6.4.  By  lama  4.6.5  - F_||  +  0.  By  lame  4.1 

Tl  ^ 


|| Hn  -  Fq||  ■*  0.  Combining,  jj  Fq*  -  Hq||  ♦  0.  By  definition. 


This  lmpllea  || F~  -  F_  ||  0.  By  lemma  4.6.8  8  -*•  Q. 

On  y  11 


For  procedure  2.  For  alaoat  every  aaaole  aeauence.  let  Q*  be 
ee  defined  In  leme  4.6.4.  By  lenmea  4.6.6  and  4.6.7 


-  Hd||  *  0  . 


Theae  together  with  lemma  4.6.2  and  4.6.3  Imply 


fron  which  vc  conclude 


(4.7.1) 


II -0 


by  definition  of  Q^. 

(4.7.1)  together  with  lanes  4.6.2  end  4.6.3  inply 

(4.7.2)  ||f£>c  -  Hj|  +  |F^>4  -  Bd||  -  0  . 

Since 


H  ■  H  +  H .  ■  F 
c  d  Q 


(4.7.2) 


W 


80  k7 

For  procedure  3.  We  assume  m  <  00  in  addition.  For  elnost  every  sanple 
sequence,  let  Q*  be  defined  as  in  Ion  4.6.4 


(4.7.3)  |(F^6(f)  -  Hm(,))2  4^(7) 

<  -  H^fl2 .0  b)l»  4.6.3  «t  1—  4.6.7. 


Let  Q*  put  nass  qj  at  61.  By  construction,  q*  -  q±  for  n 
sufficiently  large  (loan  4. 6.4), and  ^  -►  h  -  -Jg-  q£  by  the  strong 


law  of  large  wabtri.  Thar  afore,  since  m  is  flaita 


(4.7.4) 


n 


V 


I{»t>D 


*  I. 


i-1 


Vhi 


0  . 


(4.7.3)  and  (4.7.4)  together  imply 


By  definition  of  ^ 


That  is 


*<*(£•  V  -  0  • 

d(P*  ,  H  )  *►  0  . 

Si  n 


(4.7.5)  |(F^c(y)  -  h^Ct))2  ^(y)  ♦  0 

and 


(4.7.6) 


n  n 


n.  2 


Jl  a  **  "  ^ni>  1J 


where  q,  >  0  is  the  ness 


puts  at  . 


Making  uae  of  the  Inane  on  P.  453,  Choi  and  Bulgran  (1968),  it  follow*  from 
(4.7.5)  that 


(4.7.7) 


||?£.  -■.»*«• 


It  follows  iron  (4.7.6) 


Therefore 


ql*qi  .lace  -H.hl-^g-qi 


C  _  T  a  T  . 

“i  T+01  qi  T46±  qi  “  hi 

«d  *  X  Ve± 

’  k  S±  1{”i>1,{61  ‘ 


Since  m  la  finite 


|F^4-Hdll*0 


It  follova  from  this  and  (4.7.7)  that 


||Fe  -n|j-0  . 


& 

By  leu  4.6.8.  — ♦Q 


4.7e.  Cona latency  of  8  . 

_ _ n 


We  have  already  proved  the  conaletaacy  of  -  Gn(*|t)  in  section 


4.7b.  Fton  (4.1.1) 


G(e)  - 


f 


Since  ea  a  function  of  £  la  contlnuoua  and  bounded.  By  Theoree 


By  (4.2.4) 


0.25 

"  0.25  +  3 


0.2 


0.2 

13  * 


Staple  size  n  »  500. 

We  compare  eight  different  method*,  an  estimate  la  computed  only 
when  nj>  1,  1  •  1, . . .  ,5,  l.e. ,  when  we  can  successfully  determine  6 
from  the  data. 

The  eight  methods  are  described  below: 

1)  By  (4.2.6)  g4  «  h^.  So  estimate  g1  by 


A 


9  1  ®  1,  .  .  .  ,  5  . 


2)  Consider  the  "continuous"  data  as  planar  data*  use  MLE  with  known  support 

3)  Consider  the  "continuous"  data  as  planar  data,  use  Choi  and  Bulgren's 
approach  with  known  support. 

4)  Average  of  1)  and  2). 

5)  Average  of  1)  and  3). 

n.  n 

6)  Weighted  average  of  1)  and  2).  The  weights  being  —  and  —  . 

n  n 

7)  Weighted  average  of  1)  and  3). 

8)  Procedure  3  of  section  4.7.  We  use  the  simplified  version  (see  Section 
4.7d) . 

The  results  are  aa  follow:  In  94  out  of  the  100  trlsla,  9  can  be 
determined  from  the  data,  l.e.(  n  >  1.  1  -  1,...,5. 


TABLE  21 


A  comparison  of  the  8  methods  of  estimating  the  probability  mass  function 
P(8)  of  the  size  distribution  G(0)  which  puts  mass  0.2  at  6  ■  1,2,3, 4, 5 
In  terms  of  m.s.s.  and  the  average  of  the  estimates  over  94  samples  of 
slss  100. 


e 

P<6) 

1.00000 

0.20000 

2.00000 

0.20000 

3.00000 

0.20000 

4.00000 

0.20000 

5.00000 

0.20000 

METHOD 

M.S.E. 

1 

0.21215 

0.00393 

0.19677 

0.00298 

0.19859 

0.00509 

0.19768 

0.00354 

0.19481 

0.00441 

METHOD 

M.S.E. 

2 

0.19588 

0.00151 

0.19857 

0.00101 

0.20474 

0.00084 

0.20010 

0.00065 

0.20070 

0.00041 

METHOD 
M.S .E. 

3 

0.20304 

0.00180 

0.19455 

0.00145 

0.20629 

0.00111 

0.19864 

0.00093 

0.19748 

0.00056 

METHOD 

M.S.E. 

4 

0.20401 

0.00135 

0.19767 

0.00105 

0.20166 

0.00131 

0.19889 

0.00111 

0.19776 

0.00132 

METHOD 

M.S.E. 

5 

0.20759 

0.00143 

0.19566 

0.00113 

0.20244 

0.00146 

0.19816 

0.00120 

0.19615 

0.00136 

METHOD 

M.S.E. 

6 

0.19717 

0.00130 

0.19843 

0.00089 

0.20411 

0.00070 

0.19998 

0.00059 

0.20031 

0.00041 

METHOD 

M.S.E. 

7 

0.20379 

0.00155 

0.19471 

0.00126 

0.20553 

0.00094 

0.19863 

0.00084 

0.19734 

0.00054 

METHOD 

M.S.E. 

8 

0.20406 

0.00172 

0.19341 

0.00144 

0.20608 

0.00108 

0.19622 

0.00111 

0.20023 

0.00099 

Count:  Method  1,  which  uses  just  the  "discrete"  dete  is  dearly 
inferior  to  methods  2  end  3  which  regard  the  "continuous"  data  as 
planar  data.  Methods  2  and  3  are  themselves  comparable.  Method  8 
seems  to  perform  hatter  than  method  1  but  not  as  good  as  methods  2 
and  3. 

Estimate  4.  The  simple  average  of  estimates  1  and  2  is  no  better 
(maybe  even  worse)  than  estimate  2.  Similarly*  estimate  5*  the  simple 
average  of  estimates  1  and  3  is  no  better  than  estimate  3.  However* 
if  we  take  the  weighted  average*  there  is  considerable  improvement, 
giving  us  the  best  results.  (Compare  method  6  with  method  2,  method  7 
with  method  3) . 

Treating  the  "continuous"  data  as  planar  data,  we  are  back  to  the 
random  plane  case*  so  a  comparison  between  the  performance  of  the  proce¬ 
dures  w*  proposed  with  the  classical  procedures  is  already  given  in 
section  3.6. 

Recall  also  that  in  cases  1  and  2  of  section  3.6*  we  assume  we  know 
0.  While  this  assumption  is  quite  unrealistic  in  the  random  plane  case* 
In  our  present  thin  slice  case*  in  94  out  of  the  100  trials*  9  can 
actually  be  determined  from  the  "discrete"  data. 


APPENDIX 


A. I.  In  •action  3.3,  ve  give  four  conditions  (d-c4)  which  characterise 
h(*|+)  in  the  rand  on  plana  case.  The  sample  c.d.f.  H  does  not 

Q 

belong  to  the  admissible  range  {H(«|+)).  Since  the  Anderssen  and 

Jakeman  product  Integration  estimate  B  is  just  a  smoothed  version 

of  Hft,  there  is  no  reason  to  believe  that  5^  will  belong  to 

{H(* |+)}.  To  illustrate  this,  we  trill  find  a  G  such  that  P(h  violates 

n 

c4)  >  P  for  all  n  sufficiently  large,  and  for  some  P  >  0. 

Consider  yi»y2****»yn»  i’l'd.  from  the  density 


*><yl  +>  -  I  --  ■  r  —  dG(8)  , 

‘y  /Sv  G 

arrange  them  in  increasing  order  as  y  .  <  y  «<...<  y  ,  define 

n9i»  nfz  ntn 

yOf0  “  0*  We  will  consider  piecewise  linear  Interpolation,  the  simplest 
kind  of  product  integration  estimate.  That  is. 


Vy) 


As  it  stands  now,  hR  violates  cl  of  the  characterization  of  section  3.3. 
However,  this  problem  can  be  fixed  easily,  for  example,  by  applying  quadratic 
Interpolation  to  points  adjacent  to  the  origin.  But  if  c4  is  violated  which 
is  the  same  as  saying  that  Gq  Is  not  a  nondecreasing  function,  then  there 
is  no  getting  around.  We  will  find  a  G  such  that  P(hQ  violates  c4)  >  P 
for  all  n  sufficiently  large  and  for  some  P  >  0. 


f (6)  - 


£’(«)  - 


-1 


2vCT 


f"(6)  - 


-1 


4(a-fi) 


37 


Silica  6-6  .  +  e  , 
nt2  n,l  n,l 


y/a-6  <  /a - 

2 


2  a2 
y  -  8  _ 

J  nt2 


2  2 

y  -0  .-20  ,e 

n,l  n,l  n,l 


Apply  fact  1  with 


a  "  y2  “6?  i,5  -  20  ,e  .  +  e2 

n»l  Hjl  iij 


K 


”  “  e»,2  >e 


«.3 


fl2  .2 

n.2  "n.l 


n,2 


2(y2-8nfl> 


±-  f9». 

'n,2  '0 


e-  3  q2  2-e2 

n,2  n.l 

n,2  2(0 2  ,-02 

n»3  n. 


Thua  K  >  I  +  J  if 
n  n  n 


(1) 


6?  «  -  02 


2e. 


1_  _  p»2  **  n.l  rn,3  _ 1 

"•2  9n.3-en,l  K,2  /72 


n»2 


Next, 


du 


log(u  +  /u  -a  ) 


P» 


so  that 


— — —  *  log(v  +  /v2-a2)  -  log  a 

/  2  2 
ru  -a 


Thus  (1)  becomes 


fl2  a2 

i  9_  a ® 9 

i _  U,4  n. 


2e  2  a2  fl2 
“»2  0n,3  0n,l 


1  <l#tttB.3  +  ^M^.2)'10* 


rr  <l08(en.2 +  - 108  9».i>  • 

nt  a 


f(x)  -  log(a-fx)  ,  x  >  0 


f'(x)  - 


f"(x)  - 


f’"(x)  - 


(a+x) 


Ua  have  the  following 

Fact  2:  log(a-tx)  <  log  a  +  ~  x  . 


Fact  3  s  log(a4x)  >  log  a  +  —  x  — ~  x2  . 

a  2a^ 


Apply  fact  3  to  the  left  hand  side  of  (2)  and  fact  2  to  the  right  hand  side 
of  (2) ,  we  see  that  (2)  Is  satisfied  If 


1  ®n2"8nl  /ef  2 

<2  <3-<l  *± _ >-3 


<3-<2>-le»  V2> 
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Let  G(6)  have  support  on  [0,1]  so  that 


also  has  support  on 


[0,1] .  Than  with  probability  one 


6_  ,  1  t  8  _  ->■  1  ,  ,  -*■  1 

upX  Uj  *  U|  J 


en,l  -  0  •  en,2  *  0 


Note  that 


2 

n,2 


)<e 


2 

n.l 


)(0 


Taking  the  liait  In  (4),vu  have 


>  7-7  <en,l  +  /2en,l>  • 
“»A 


Since  eQ  1  -►  0,  en  2  ■*“  0  with  probability  one,  i/2en  2  dominates  the 

bracketed  terns  on  the  left  hand  side  and  «5e  7  on  the  right  hand  side 

n,x 

So,  we  can  look  at 


2e  ,  e  ,+e  ,  en,2  e  .  n,l 

n,2  n,l  n,l  n,l  • 


-Ai-  >2/-^ 

*n,i#<n#2  *  en,l 


(6)  la  satisfied  if 


<l/Sbk 

en,l  en,2 


■T""1--  >  c  for  c  sufficiently  large,  actually 

».2 


c  »  9  will  do.  So  P(5  violates  c4) 

n 


>  Pd^+J,,  <  o 

n  n  n 


>  p(- n«  >  c)  if  G  has  support  on  [0,1]  . 
n,2 


How,  if  y^.yj, •  ••  »yQ  are  l.l.d.  according  to  the  uniform  distribution 
on  [0,1],  than 


15 


Than 


n,n  n,n-l  ' 


n.n-4 


*(2)^(1) 
vhara  x^,*2  i.l.d.  ^  U(0,1) 


Y  -Y 
n,n  n,n-l 


X<2)"*(1) 


P(ia*i  >  C)  .  P(_ — iil.  > 

en,2  *(2)“*(1) 


c)  -  P  >  0  . 


But  th«  uniform  danalty  on  [0,1]  violates  el  and  banco  doaa  not  correa- 
pond  to  any  G. 

However,  taka 

,(e|t)  -  -ssL  o  <  e  <  l  . 

Than 


lot  C  -  e2,  dg  -  26  d6 


“■“"'©‘I' . 


cytaln”1^!)  -  •in“1(0)) 


Ty 


for  0  <  y  <  1 


Thu.  h(l-|f)-5|. 

Than  If  1.1*4.  according  to  tha  abova  H(»l+) 


€n  l  To  n  i~Yn  n^2  E~1(Un  ^1)  ~H~1(Pn>n_2) 
en.2  Yn,n“Yu,n-l  n>  “  ®"1(Da,n-l> 


vhara  u, ,u„,...,u  1.1.4.  ^  U(0,1)  aa4  u  ,<•••<  u  _  thalr  oxdarad 

1  *  U  H|X  Ugll 

valua.  But 


a  <Un.n>-B  <D».a-l) 


Vn-rVn-S  *Vil« 


Sine*  V.  . 

“i* 


Sine* 


■*  1,  w  ,  -*■  1  with  probability  ona  and  h(l-|+)  la  finlta 
n»2 

b(W  J+) 

^  ■*  1  with  probability  1 
n»l 

iis  >  c)  -  iis  ?(-ataiL-j&«a=g.  >  c) 

n  en,2  n  n,n~  n,n-l 

-  P  >  0  . 

P(hn  vlolataa  c4) 

>  *<VJ.  *  S> 

e  . 

>  P (r5^  >  c)  ♦  P 

“.2 

P(hQ  vlolataa  c4)  >  P  for  n  sufficiently  large. 


J  J 


K' 

v  4 


A. 2.  Proof  of  1— as  3.5.1  and  3.5.2. 

Wo  first  prove  the  following  leans. 

Leans:  Let 

*1**2*  *  *  *  **n  ***  ^hiy|t)  ■  J  — -  •L  —  <kj_  (8 1 1)  • 

'l  8 

Let  A  ■  inf  {8:  6(6)  “  1},  possibly  <*>  then  elnost  every  eenple  sequence 
{yfc}  la  dense  in  (0,A). 

Proof.  The  definition  of  hQ(y|+)  end  A  together  laply 
(1)  *»o(y|+)  >0  for  ye  (0,A)  . 

Let  Q  -  the  set  of  positive  rationale 

Q*  -  Q  n  (0,A) 

J  -  Uq-r.q+r),  q  e  Q*.  r  e  Q)  , 


let  (a,b)  e  J 

(1)  -o-  P(a,b)  >  0 

->  for  alaost  every  sample  sequence  (y^J 
{yfc}  ft  (a,b)  4  +  . 

Since  J  is  countable,  we  conclude  that  for  almost  every  sample  sequence 

{yk>* 

(yk)  ft  1  4  ♦  nt  J. 
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Sine*  Q*  la  dense  In  (0,A),  {yk  }  is  also  danse  In  (0,A). 

L—a  3.5.1.  For  almost  every  sample  sequence,  there  exist  e  Q*eQ 


for  each  n  such  that 


&  ff 

Q*  —  Qq  end  F  — 


F.  . 
% 


Proof.  By  the  preceding  leans,  we  know  that  almost  every  sample  sequence 
{yk>  is  dense  in  (0,A) .  For  such  a  sample  sequence,  define  e  sequence 
of  distribution  functions  Q*  as  follows: 

Q?  is  dl8crate  with  weights  at  y  ,<y  „<...<  v 
“  ^  'n,l  Jnt2  7n,n 

and 

QS(yn>1)  -  Q0(ynl>  ,  i  -  l,...,n-l 
W’n,n>  *  1  ■ 

Of 

Since  {yfc}  is  dense  in  <0,A),  0*“^%.  As  a  function  of  0,  F0(y)  is 

continuous  and  bounded.  So 

ro»-  \  *  |  Te  ■%  ■ 


lo— a  3.5.2.  For  almost  every  sample  sequence,  there  exists  a  q*  e 

J0r 

for  each  n  such  that  QJ  — ►  Qq,  F^  -5-  and 


r  i  «««>  -  r  \  dV» . 


Proof.  For  almost  every  sample  sequence,  we  claim  that  the  same  sequence 
Q*  defined  in  the  preceding  proof  would  also  work  here.  The  only  thing 
that  is  left  to  be  proved  is 


C  ?  d<«<»  *  C  $  dQ°<o  • 

The  proof  is  by  a  several  e  argument.  Since  /*  ■§■  dQQU)  <  00 ,  by 
problem  2,  P.  43  of  Chung,  there  exists  a  <  b  <  c  <  d,  points  of  continuity 
of  Qq  such  that 


(2) 

•j.  T  4V£>  -  j 

I 

0  5 

dQ0(O|  <  e 

(3) 

'C  *  dQo<5>  -  1 

*  1 

0  * 

dQ0U>|  <  2e 

Let 

«n  ■  *l«{yks  1  £  k  £  n,  yfc  >  a> 
dn  -  max{yk:  1  £  k  £  m,  yk  <  d}  . 


Since  {yk>  is  dense 

<*)  £  X  4Qi(£)  -  f0  T 

■  I;  T  4,««>  -  |‘n+  t  4V£> 

+  f *  *  iQ*<0  -  f**  i  40,(0 

ii  n 

+  C+T4,SK)'  j]+  T 4V£)  • 


►  .  A-. 


'» .«*.  •*.  »’.  »*.  %*«.►'.  »*.  <•*.  •"  “’■  >"  -  /  •“  »*  »*  •  •*  •’  •  j  *  *  *  *.“-»  *-'*■’  ’•**•*■ 


k;' 


6 


> 

K‘ 


i-:-. 


Since  a  +  a  for  n  sufficiently  largo 

u 

d  +d 
o 


(5)  |*“+  j  dQ*<0  <  Ja“+  |  dQ0(5)  by  definition  of  Q* 


<  2e  by  (3) 


(6) 


(7) 


!  |  dq  (O  <  2e  by  (3)  . 

Jd  +  % 

n 

f  ±*£«>  </<  2c  without  lose  of  generality 


On  the  closed  Interval  [a,d] ,  j  is  uniformly  continuous,  so  that 


(8)  there  exists  5  such  that  if  x,y  e  la»d], 

|*-y|  <  2d,  then  -  ~|  <  €  • 


If  we  divide  the  interval  [a,d]  into  subintervals  of  equal  length 
d*  <  d,  then  since  {yfc}  is  dense  in  (0,A) 


(9) 


for  n  sufficiently  large,  there  will  be  soee 
aaong  y ^ , y^ » • • • , yn  lying  inside  each  of  the 
eubintervsls. 
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«X  mi  »-■  m£ 


< 


Let  y  j  <yn  2  <  •••  <yn>n  be  yl*y2*****yn  ®rran8®d  increasing 


order.  Define  ±n>  jQ  by 


*n  *  *a.i. 


d  ■  y  . 

®  n»J. 


Then 

r  i  f  ^_+  i 

(10) 

< 

-  i\  }«•*' 

n  n 

j  -1  + 

•I  r°'*tl(5i<1Qa*«)-IdV5» 

n  yn,» 

By  deflniclon 

+ 

i 

f.r1 

and 


rr1  t4vo 

yn,a 


_1_ 
:ntm 


where 


n.a 


y* 

'n,m 


<  y. 


n,m+l 


So  (10)  becones 


(ID 


W0  -  f*  i  d%«> 

n  n 

i‘i 

•  I  <r— -  3T-)  <Q»(y.  >  -  «n<y.  .»  • 


For  n  sufficiently  large 


26 


so  that 


fci —  "  I  <  e  by  (8)  . 


yn,a  yn,a+l 


Making  use  of  this  in  ( 11 ),  we  get 


(12) 


CC  T  -  f*  \  4W» 


<  e 


Apply  (5).  (6),  (8),  (12)  to  (4) 


if  ^  4Q*(£)  -  j"  *%(Q  |  <»c 


for  n  sufficiently  large,  e  Is  arbitrary,  so 


A.3.  Proofs  of  the  lemmas  of  gee t ion  4.6. 

Lemma  4.6.1.  With  probability  one,  [[ H^— H||  0  . 

Proof.  This  is  a  well  known  property  of  the  sample  c.d.f.,  see  P.  133,  Chung. 


4.6.2.  With  probability  one,  0  . 


Proof.  By  the  strong  law  of  large  numbers,  for  each  i 


W  -  H„d(9r)  *  Hd(0i)  -  VV* 


Next,  we  will  prove  for  each  t,  including  t  *  00  , 


H^t)  -►  Hd(t)  a.s. 


Since 


H4(c) 


l  \ 

i  1 


"ii* 


given  e  >  0,  there  exists  0.  ,0.  ,...,0.  <  t,  k  finite  such  that 

11  x2  xk  “ 


1  I  “  -  Hd(t> I  <  e  . 

3-1  3 


Since  k  is  finite,  for  almost  every  sample  sequence,  {0  }  C 

*3  3-1 

n  sufficiently  large. 


$  for 
n 


So,  for  almost  every  sample  sequence, 


1  n  ^  n 

«  Ji  1r {ei . V  1  Hnd<t)  " » i-i 1  v{V  ek  i tl 


for  n  sufficiently  large 


By  the  strong  lav  of  large  numbers 


k 

L.H.S.  of  (A)  -*■  £  h.  a. 8. 

J-1  i 


R.H.S.  of  (A)  -► 


l 

1 

01<t 


hl" 


Hd(t) 


Since  e  is  arbitrary  In  (3) 


Hnd(t)  ♦  Hd(t)  a. s. 

In  particular. 


(5)  /H^  -  / Hj  a.s. 

So  if 


H. 


H 


nd 


nd  /  H 


nd 


Hd 


(1),  (2),  (5)  together  imply 


®nd<V  -  Hnd(6r>  *  W  -  W* 


a.s 


Hnd(t)  Hd(t) 
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So,  by  the  1 


on  P.  133,  Chung 


ll»nd  *  “dll  *  0 


i.e. 


Hnd  Hd, 


II7^j-7hjII’0  *  • 


HHnd  “  HdH  0 


4.6.3.  With  probability  one,  [[h^  -Hc||  •+•  0  . 


Proof.  Direct  consequence  of  lemma  A. 6.1  and  lemma  4.6.2. 


Lemma  4.6.4.  For  alaost  every  sample  sequence,  there  exists  a  Q*  e  Qn  for 
each  n  such  that 


and 


o^e^-o^er)  -  qce^-QCe^) 


for  n  sufficiently  large. 

I 

Proof.  The  idea  is  essentially  the  sane  as  In  the  proof  of  lemma  3.5.2. 
There,  we  show  that  almost  every  sample  sequence  is  dense  and 

we  require  q£  to  agree  with  Q  at  y^y^ . .  •  »yn*  Here,  all  we  need  to 
do  is  to  add  an  extra  requirement 

Qj($i>  -  QC4±>  -Q<*t-> 


for  e  t 


Slues  e  *oo>  *  1  by  (4.5.3).  For  almost  every  sample  sequence, 

0.  6  4  for  n  sufficiently  large  so  that 
x  n 

(6)  QjjCQj)  -  Q*(0±-)  -  Q(0t)  -  0(0^) 

for  n  sufficiently  large. 

T i— ns  4.6.5.  For  almost  every  sample  sequence,  let  Q*  be  defined  as  in 

Proof.  (4.3.2)  and  (6)  together  Imply 

pq.<V  *  pq.(0i->  *  W  -  yer>  • 

As  a  function  of  y,  FQ(y)  has  a  jump  at  0  »  y.  It  is  otherwise  continuous 
and  11m  F0(y)  ■  0.  So  If  y  is  a  point  of  continuity  of  F_,  by  a 

0-H»  0  " 

slight  extension  of  Theorem  4.4.1,  P.  87,  Chung 

Fo*(y)  ■  /  F0<y>  dQ£<e>  *  /  F0(y>  dQ(0)  -  FQ(y)  • 

Again,  by  the  lenma  on  P.  133,  Chung 

H,«S  ‘  'o'1  * 0 ' 

Lent  4.6.6.  For  almost  every  sample  sequence,  let  Q*  be  as  defined  in 


Mote:  See  section  4.3d  for  definition  of 
Proof.  Fro*  (4.3.2) 


(7) 


By  construction,  Q*  puts  ness  et  8i  if 
becomes 


<8>  F«S'd  ‘  Ji  lS>1}  !ei 

Hr-hi*w^'1i-,''*i>l  for  “ 

A  comparison  of  (7)  and  (8)  then  tells  us 

<9>  F^a<ei>  -  p$*),d<9i-)  -  W  -  W5 

'^’i  for  “ 


Given  t. 


«d(t) 


9^  t 


n^  >  1  so  that  (4.3.3) 


sufficiently  large  . 


sufficiently  large. 


Given  e  >  0,  there  exists  9. 


6.  <  t,  k  finite  such  that 


Sines  k  is  finite,  fox  n  sufficiently  large 
eo  that 


n.  >  1,  j 
J 


Ct)  <  Hd(t)  . 


Since  e  la  arbitrary  In  (10) 


»$4<e>  *  M**  • 


This  and  (9)  Imply 


Hdll  -  0  . 


4.6.7.  For  almost  every  sample  sequence,  let  Q*  be  as  defined 


In  lemma  4.6.4, 


ll'^c  -M-°  • 


Proof.  Direct  consequence  of  Inane  4.6.5  end  liana  4.6.6. 

a 


jg|  Of 

4.6.8.  If  F„  — • ►Fq,  than  — +Q 


Proof.  Sines  F 


% 


Fq,  if  t  Is  e  point  of  continuity  of  Fq, 


(11) 


F«  (t) 


f  Ftt(t)dQ_(e)  Fn(t) 


r  Fft(t)dQ(6) 


L«t  be  any  aubaequance  of  such  that  — ►  Q*  whore  Q*  ia 

a  aubdlatrlbutlon  function,  wo  will  bo  dona  If  wa  can  ahow  Q*  ■  Q. 
Suppoaa  t  la  alao  a  point  of  continuity  of  Q*,  than  aa  in  the  proof 
of  la«ui  4.6.5,  by  extending  Theorm  4.4.1,  P.  87,  Chung  alightly 


f0  V*>  d\<»  *  f  V‘>  dQ*(0)  . 


Cooper! non  with  (11)  yielda 


(12)  £  F0(t)  dQ(8)  -  F0(t)  dQ*(0) 

Since  11a  F_(t)  -  1,  by  aonotona  convarganca 
t-*«  0 

lo  dQ<0>  ■  lta  J*  F0(t)dQ(8) 

-  lia  I  Fft(t)dQ*(8) 
**•  JO  0 


-  I  dQ*(8)  . 
70 


So  Q*  111  distribution  function.  Since  (12)  holda  whenever  t  ia 

a  point  of  continuity  of  both  Fq  and  Q*  and  the  aat  of  auch  point  a 

Jff 

la  daaaa,  Fq-Fq*,  ao  Q-Q*  by  idaot  if  lability.  Therefore  — *Q  . 

Laaaa 4. 6.9.  if  6  la  diacreto,  than  for  alaoat  ovary  aaapla  sequence. 


there  axlat  Q*,  Q£  discrete, puts  waighta  at  •  *  ’  ’^a(n)  Mcta  tlMKt 

the  atataaanta  of  laanae 4.6.4,  4.6.5,  4.6.6,  4.6.7  atlll  hold. 

Proof.  The  proof  la  alallar  to  the  proof  a  of  laaaaa  4.6.4,  4.6.5,  4.6.6, 
and  4.6.7  and  will  be  oalttod. 
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^  Spherical  particles  art  disparted  readonly  In  e  three-dimensional 
body.  The  contort  of  tbt  ephoret  art  distributed  according  to  a  dilute 
Poisson  process.  The  rsdll  of  such  sphsrts  have  a  distribution  G 
Independent  of  everything  else.  A  random  probe  (line,  plane  or  thin  — 
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•lice)  is  cut  through  the  volumes.NTha  observations  y  are  the  following: 
i)  A>r  the  rand oc  line,  its  Intersection  with  a  sphere  is  a  line  segment, 
y  is  half  the  length  of  Chat  segment,  ii)  For  the  random  plane,  y  is 
the  radius  of  the  eirele  of  intersection,  ill)  For  the  thin  slice,  y 
is  the  maximum  of  the  radii  of  the  circles  of  intersection.  The  problem 
is  to  estimate  G  from  the  observations  y, , . . .  ,y  . 

Past  procedures  make  use  of  the  inversion  formula  that  expresses  the 
particle  size  distribution  G  as  a  function  of  the  distribution  H  of 
the  observed  data.  An  estimate  G  is  obtained  by  replacing  H  in  the 
formula  by  an  estimate  H  where  H  la  either  the  sample  c.d.f.  or  e 
smooth  version  of  it.  These  procedures  do  not  take  the  structure  of  the 
problem  into  account.  Consequently,  they  have  some  serious  shortcomings. 

"^•Taking  the  viewpoint  of  nonparametric  estimation  of  mixing  distribu¬ 
tions,  we  propose  a  new  procedure  that  deals  with  the  shortcomings  of  the 
classical  procedures.  We  consider  linear,  planar  and  thin  slice  data. 

In  all  three  cases,  our  approach  performs  better  than  the  classical 
procedure.  In  addition,  we  prove  consistency  results. 

In  the  random  plane  case,  we  discuss  the  right  way  and  the  wrong  way 
to  bootstrap  the  distribution  of  a  stereologlcal  estimate,  corresponding 

to  whether  we  have  taken  the  structure  of  the  problem  into  account  or 
not.  In  the  thin  slice  case,  when  G  is  mixed  or  discrete,  the 
formulas  Involve  a'  decomposition  of  1  Into  its  continuous  and  discrete 
component.  This  makes  the  estimation  problem  more  complicated  but  also 
more  interesting  especially  in  the  discrete  case.  We  propose  a  few 
procedures  which  involve  a  decomposition  of  the  data  corresponding  to 
that  of  H.  ^ 
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